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METHOD AND REAGENT FOR THE INHIBITION OF TELOMERASE ENZYME 

[0001] This application is a continuation of U.S. Application No. 09/653,225 filed 
August 31, 2000 which claims the benefit of U.S. Provisional Application No. 60/151,713 
filed on August 31, 1999 and U.S. Provisional Application No. 60/197,769 filed on April 
14, 2000. All of the applications are incorporated by reference herein in their entireties, 
including the drawings. 

[0002] The Sequence Listing file named "MBHB00,882-C SequenceListing.txf 
(1,187,852 bytes in size) submitted in duplicate on Compact Disc-Recordable (CD-R) 
medium ("01091 3_1 300") in compliance with 37 C.F.R. §1. 52(e) is incorporated herein 
by reference. 

Background Of The Invention 

[0003] The present invention concerns compounds, compositions, and methods for the 
study, diagnosis, and treatment of conditions and diseases related to the level of telomerase 

enzyme. 

[0004] The following is a brief description of the current understanding in the biology of 
telomerase and its components. The discussion is not meant to be complete and is 
provided only for understanding of the invention that follows. The summary is not an 
admission that any of the work described below is prior art to the claimed invention. 

[0005] The ribonucleoprotein enzyme telomerase consists of an RNA template subunit 
and one or more protein subunits including telomerase reverse transcriptase (TERT), which 
function together to direct the synthesis of telomeres. Telomeres exist as non-nucleosome 
DNA/protein complexes at the physical ends of eukaryotic chromosomes. These capping 
structures maintain chromosome stability and replicative potential (Zakian, V. A., 1995, 
Science, 270, 1601-1607). Telomere structure is characterized by tandem repeats of 

1 Express Mail Label No. EV333565533US 

Application Filed On November 13, 2003 



MBHB00-882-) (400/118) 

conserved DNA sequences rich in G-C base pairs. Additional conserved telomere elements 
include a terminal 3'-overhang in the G-rich strand and non-histone structural proteins that are 
complexed with telomeric DNA in the nucleus. (Blackburn, "E., 1990, JBC, 265, 5919- 
5921.). Observed shortening of telomeres coincides with the onset of cellular senescence 
in most somatic cell lines lacking significant levels of telomerase. This finding has had a 
profound impact on our views concerning the mechanisms of aging, age related disease, 
and cancer. 

[0006] Conventional DNA polymerases are unable to fully replicate the ends of linear 
chromosomes (Watson, J. D., 1972, Nature, 239, 197-201). This inability stems from the 3' 
G-rich overhang that is a product of ribonuclease cleavage of the RNA primer used in DNA 
replication. The overhang prevents DNA polymerase replication since the recessed C-rich 
parent strand cannot be used as a template. Telomerase overcomes this limitation by 
extending the 3' end of the chromosome using deoxyribonucleotides as substrates and a 
sequence within the telomerase RNA subunit as a template. (Lingner, J., 1995, Science, 
269, 1533-1534). As such, telomerase is considered a reverse transcriptase that is 
responsible for telomere maintenance. 

[0007] Telomerase was first discovered by in Tetrahymena thermophila in 1985 (Greider, 
C. W., 1995, Cell, 43, 405-413). The RNA subunits and their respective genes were later 
discovered and characterized in protozoa, budding yeast, and mammals. Genetic studies of 
these genes confirmed the role of telomerase RNA (TR) in determining telomere sequence by 
mutating genes which encode the telomeric RNA (Yu, G. L., 1990, Nature, 344, 125-132), 
(Singer, M. S., 1994, Science, 266, 404409), (Blasco, M. A., 1995, Science, 269, 1267- 
1270). These studies showed that telomerase activity parallels TR expression in protozoa, 
yeast and mice. However, the expression of human telomerase RNA (hTR) does not 
correlate well with telomerase activity in mammalian cells. Many human tissues express hTR 
but are devoid of telomerase activity (Feng, J., 1995, Science, 269, 1236-1241). Knockout 
mice, in which the mTR gene has been deleted from germline cells, have been shown to be 
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viable for at least six generations. Cells from later generations of these mice showed 
chromosomal abnormalities consistent with telomere degradation, indicating that mTR is 
necessary for telomere length maintenance, but is not required for embryonic development, 
oncogenic transformation, or tumor formation in mice (Blasco, M. A., 1997, Cell, 91, 25-34). 

[0008] The first catalytically active subunit of telomerase (pi 23) was isolated from 
Euplotes aediculatus along with another subunit (p43) and a 66-kD RNA subunit (Linger, J., 
1996, Proc. Natl. Acad. Sci., 93, 10712-10717). Subsequent studies revealed telomerase 
catalytic subunit homologs from fission yeast (Est2p) and human genes (TRTl). The human 
homolog, TRTl encoding hTERT, expressed mRNA with a strong correlation to telomerase 
activity in human cells (Nakamura, T. M., 1997, Science, 277, 955-959). Reconstitution of 
telomerase activity with in vitro transcribed and translated hTERT and hTR, either co- 
synthesized or simply mixed, demonstrated that hTERT and hTR represent the minimal 
components of telomerase. Furthermore, transient expression of hTERT in normal diploid 
human cells restored telomerase activity, demonstrating that hTERT is the only component 
necessary to restore telomerase activity in normal human cells (Weinrich, S. L., 1997, 
Nature Genetics, 17, 498-502). The introduction of telomerase into normal human cells 
using hTERT expression via transfection has resulted in the extension of life span in these 
cells. Such findings indicate that telomere loss in the absence of telomerase is the "mitotic 
clock" that controls the replicative potential of a cell prior to senescence (Bodnar, A. G., 
1998, Science, 279, 349-352). 

[0009] Expression of telomerase is observed in germ cell and most cancer cell lines. 
These "immortal" cell lines continue to divide without shortening of their telomeres (Kim, N. 
W., 1994, Science, 266, 2011-2015). A model of tumor progression has evolved from 
these findings, suggesting a role for telomerase expression in malignant transformation. 
Successful malignant transformation in human cells was accomplished for the first time by 
ectopic expression of hTERT in combination with two oncogenes, SV40 large-T and H-ras. 
Injection of nude mice with cells expressing these oncogenes and hTERT resulted in rapid 
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growth of tumors. These observations indicate that hTERT mediated telomere maintenance 
is essential for the formation of human tumor cells (Hahn, W. C, 1999, Nature, 400, 464- 
468). 

[0010] Various methods have been developed to assay telomerase activity in v/tro. The 
most widely used method to characterize telomerase activity is the telomeric repeat 
amplification protocol (TRAP). TRAP utilizes RT-PCR of cellular extracts to measure 
telomerase activity by making the amount of PGR target dependant upon the biochemical 
activity of the enzyme (Kim, N. W., 1997, Nucleic Acids Research, 25, 2595-2597). 

[0011] A variety of animal models have been designed to assay telomerase activity in 
vivo. Inhibition of telomerase activity has been analyzed in rats via cell proliferation studies 
with MNU (N-methyl-N-nitosurea) induced mammary carcinomas in response to treatment 
with 4-(hydroxyphenyl)retinamide {4-HPR), a known inhibitor of mammary carcinogenesis in 
animal models and premenopausal women (Bednarek, A., 1999, Carcinogenesis, 20, 879- 
883). Additional studies have focused on the up-regulation of telomerase in transformed cell 
lines from animal and human model systems (Zhang, P. B., 1998, Leuk. Res., 22, 509-516), 
(Chadeneau, C, 1995, Oncogene, 11, 893-898), (Greenberg, P., 1999, Oncogene, 18, 
1219-1226). 

[0012] Human cell culture studies have been established to assay inhibition of telomerase 
activity in human carcinomas responding to various therapeutics. A human breast cancer 
model for studying telomerase inhibitors is described (Raymond, E., 1999, Br. J. Cancer, 
80, 1332-1341). Human studies of telomerase expression as related to various other 
cancers are described including cervical cancer (Nakano, K., 1998, Am. J. Pathol, 153, 
857-864), endometrial cancer (Kyo, S., 1999, Int. J. Cancer, 80, 60-63), meningeal 
carcinoma (Kleinschmidt-DeMasters, B. K., 1998, J. Neurol. Sci., 161, 124-134), lung 
carcinoma (Yashima, K., 1997, Cancer Reseach, 57, 2372-2377), testicular cancer in 



4 



MBHB00^82-I (400/1 18) 

response to cisplatin (Burger, A. M., 1997, Eur. J. Cancer, 33, 638-644), and ovarian 
carcinoma (Counter, C. M., 1994, Proc. Natl. Acad. Sci., 91, 2900-2904). 

[0013] Particular degenerative and disease states that can be associated with 
telomerase expression modulation include but are not limited to: 

• Cancer: Almost all human tumors have detectable telomerase activity (Shay, J. W,, 
1997, Eur. J. Cancer, 33, 787-791). Treatment with telomerase inhibitors may 
provide effective cancer therapy with minimal side effects in normal somatic cells that 
lack telomerase activity. The therapeutic potential exists for the treatment of a wide 
variety of cancer types. 

• Restinosis: Telomerase inhibition in vascular smooth muscle cells may inhibit 
restinosis by limiting proliferation of these cells. 

• Infectious disease: Telomerase inhibition in infectious cell types that express 

telomerase activity may provide selective anti-infectious agent activity. Such 
treatment may prove especially effective in protozoan-based infection such as Giardia 
and Lesh Meniesis. 

• Transplant rejection: Telomerase inhibition in endothelial cell types may demonstrate 
selective immunnosuppressant activity. Activation of telomerase in transplant cells 
could benefit grafting success through increased proliferative potential. 

• Autoimmune disease: Telomerase modulation in various immune cells may prove 
beneficial in treating diseases such as multiple sclerosis, lupus, and AIDS. 

• Age related disease: Activation of telomerase expression in cells at or nearing 
senescence as a result of advanced age or premature aging could benefit conditions 
such as macular degeneration, skin ulceration, and rheumatoid arthritis. 
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[0014] The present body of knowledge in telomerase research indicates the need for 
methods to assay telomerase activity and for compounds that can regulate telomerase 
expression for research, diagnostic, trait alteration, animal health and therapeutic use. 

[0015] Gaeta et al, US patents No. 5,760,062; 5,767,278; 5,770,613 have described 
small molecule inhibitors of human telomerase RNA (hTR) subunit. 

[0016] Blasco et a/., 1995, Science, 269, 1267-1270 describe the synthesis and testing 
of antisense oligonucleotides targeted against a specific region of the mouse telomerase 
RNA (mTR) subunit and reported reduction in telomerase activity in mice. 

[0017] Bisoffi et a/., 1998, Eur. J. Cancer, 34, 1242-1249 have studied the down 
regulation of human telomerase activity by a retrovirus vector expressing antisense RNA 
targeted against the hTR RNA. 

[0018] Norton et a/., 1996, Nature Biotechnology, 14, 615-619 have reported the use of 

a peptide nucleic acid (PNA) molecule targeting hTR RNA to down regulate telomerase 
activity in human immortal breast epithelial cells. 

[0019] Yokoyama et a/., 1998, Cancer Research, 58, 5406-5410 have reported the 
synthesis and testing of hammerhead ribozyme constructs targeting hTR RNA resulting in a 
decrease in the telomerase activity in Ishikawa cells. 

[0020] Henderson, European Patent Application No. 666,313-A2 describes methods of 

identifying and cloning hTR gene for use in gene therapy approaches for creating aberrant 
telomeric sequences in transfected human tumor cells. A ribozyme based gene therapy 
approach to inhibit the expression of hTR gene is described as well. The intended result of 
such therapies involves incurred genetic instability based on non-native telomeric sequences 
resulting in rapid cell death of the treated cells. 
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[0021] West et al, US patent No. 5,489,508 describe methods for determining telomere 
length and telomerase activity in cells. Inhibitors of hTR RNA, including oligonucleotides 
and/or small molecules are described. 

[0022] These foregoing approaches of targeting the telomerase RNA subunit (TR) may 
not be very beneficial, because as demonstrated by Feng et a/., (Feng, J., 1995, Science, 
269, 1236-1241), telomerase activity in humans does not correlate well to hTR 
concentration. 

[0023] Collins et a/., International PCT publication No. WO 98/01542 describes assays 
for the detection of telomerase activity. Four human telomerase subunit proteins are 
described called pl40, pl05, p48 and p43. In addition, hybridization probes and primers 
are described as inhibitors of telomerase gene function. Antibody based inhibitors of 
telomerase protein subunits are described. 

[0024] A more attractive approach to telomerase regulation would involve the regulation 
of human telomerase by modulating the expression of the protein subunits of the enzyme, 
preferably the reverse transcriptase (hTERT) subunit. Based of reconstitution experiments, 
hTERT and hTR represent the minimal components of telomerase. Since hTR expression 
does not correlate well with telomerase activity in human cells and since many human cells 
express hTR without telomerase activity, targeting hTERT may prove more beneficial than 
targeting hTR. hTERT is the only component necessary to restore telomerase activity in 
normal human cells. A study in which the three major subunits of telomerase (hTR, TPl, and 
hTERT were assayed in normal and malignant endometrial tissues determined that hTERT is 
a rate limiting determinant of enzymatic activity of human telomerase (Kyo, S., 1999, Int. J. 
Cancer, 80, 60-63). Additional protein subunits that have been isolated most likely serve 
only a structural role in telomerase activity, but may be important in enhancing the activity of 
the telomerase enzyme. As such, hTERT is one of the better targets for the ectopic 
regulation of telomerase activity. 
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[0025] Cech et al., International PCT publication No. WO 98/14593 describe 
compositions and methods related to hTERT for diagnosis, prognosis and treatment of 
human diseases, for altering proliferative capacity in cells and organisms, and for screening 
compounds and treatments with potential use as human therapeutics. 

[0026] Cech et ai, International PCT publication No. WO 98/14592 describe nucleic acid 
and amino acid sequences encoding various telomerase protein subunits and motifs of 
Euplotes aediculatus, and related sequences from Schizosaccharomyces, Saccharomyces 
sequences, and human telomerase. The polypeptides comprising telomeric subunits and 
functional polypeptides and ribonucleoproteins that contain these subunits are described as 
well. Cech etai, International PCT Publication No. WO 98/14592, mentions in general terms 
the the possibility of using antisense and ribozymes to down regulate the expression of 
human telomerase reverse transcriptase enzyme. 

Summary Of The Invention 

[0027] The invention features novel nucleic acid-based techniques [e.g., enzymatic nucleic 
acid molecules (ribozymes), antisense nucleic acids, 2-5A antisense chimeras, triplex DNA, 
antisense nucleic acids containing RNA cleaving chemical groups (Cook et ai, U.S. Patent 
5,359,051)] and methods for their use to down regulate or inhibit the expression of 
telomerase enzyme. 

[0028] In a preferred embodiment, the invention features use of one or more of the 
nucleic acid-based techniques to inhibit the expression of the genes encoding the protein 
subunits of the telomerase enzyme, preferably the catalytic subunit of the telomerase 
enzyme. Specifically, the invention features the use of nucleic acid-based techniques to 
specifically inhibit the expression of telomerase reverse transcriptase (TERT) gene. 
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[0029] In another preferred embodiment, the invention features the use of an enzymatic 
nucleic acid molecule, preferably in the hammerhead, NCH, G-cleaver and/or DNAzyme 
motif, to inhibit the expression TERT gene. 

[0030] In another preferred embodiment, the invention features the inhibition or down 
regulation of telomerase activity by inhibiting or down regulating the expression of one or 
more activators of telomerase enzyme, such as protein encoded by ras gene. Such 
activator gene expression may be regulated by the use of nucleic acid-based techniques, 
such as enzymatic nucleic acid molecules and antisense oligonucleotides. 

[0031] By "inhibit" it is meant that the activity of telomerase enzyme or level of RNAs or 
equivalent RNAs encoding one or more protein subunits of the telomerase enzyme is 
reduced below that observed in the absence of the nucleic acid. In one embodiment, 
inhibition with enzymatic nucleic acid molecule preferably is below that level observed in the 
presence of an enzymatically inactive or attenuated molecule that is able to bind to the same 
site on the target RNA, but is unable to cleave that RNA. In another embodiment, inhibition 
with antisense oligonucleotides is preferably below that level observed in the presence of for 
example, an oligonucleotide with scrambled sequence or with mismatches. In another 
embodiment, inhibition of TERT genes with the nucleic acid molecule of the instant invention 
is greater than in the presence of the nucleic acid molecule than in its absence. According to 
the invention, the activity of telomerase enzyme or the level of RNA encoding one or more 
protein subunits of the telomerase enzyme is inhibited if it is at least 10% less, 20% less, 
50% less, 75% less or even not active or present at all, in the presence of a nucleic acid of 
the invention relative to the level in the absence of such a nucleic acid. 

[0032] As used herein, the term "telomerase activity" refers to enzyme activity that 
replicates, for example, the TTAGGG repeats at the ends of linear chromosomes. 
Telomerase activity is comprised by a ribonucleoprotein enzyme comprising one or more 
protein subunits and an RNA subunit. The enzymatic activity extends the 5'-recessed end of 
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a linear chromosome using deoxyribonucleotides and an RNA sequence within the RNA 
subunit as a primer. Telomerase activity may be assayed as follows. Samples to be 
assayed for telomerase activity are prepared by extraction into CHAPS lysis buffer {lOmM 
Tris pH 7.5, ImM MgCb, ImM EGTA, 0.1 mM PMSF, 5mM -mercaptoethanol, ImM DTT, 
0.5% 3-[(3-cholamidopropyl)-dimethyl-amino]-l- propanesulfonate (CHAPS), 10% glycerol and 
40 U/ml RNAse inhibitor (Promega, Madison, Wl, U.S.A.). Cells are suspended in CHAPS lysis 
buffer and incubated on ice for 30 minutes, which allows lysis of 90-100% of cells. Lysate is 
then transferred to polyallomer centrifuge tubes and spun at 100,000 x g for 1 hour at 4 
degrees C. The supernatant is the protein extract, and concentration ranges of 4-10 pg/pl 
are suitable for telomerase assay. Extracts may be concentrated if necessary using a 
Microcon Microfilter 30 (Amicron, Beverly, MA U.S.A.) according to the manufactureris 
instructions. Extracts may be stored frozen at -80 degrees C until assayed. 

[0033] Telomerase may be assayed according to Kim and Wu, Nucl. Acids Res. 25: 
2595-2597, incorporated herein by reference. Briefly, for the telomerase assay, 2Mg of 
protein extract is used. The extract is assayed in 50pl of reaction mixture containing 0.1 \ig 
TS substrate primer {5-AATCCGTCGAGCAGAGn-3' (SEQ. ID. NO. 5569) end-labeled using 
alpha-22p./^jp and T4 polynucleotide kinase)(SEQ. ID. NO. 5570) 0.1 |jg ACX return primer (5'- 
GCGCGG[CnACC]3 CTAACC-3'), 0.1 pg NT internal control primer (5'- 
ATCGCTTCTCGGCCTnT-3') (SEQ. ID. NO. 5571) 0.01 micromol TSNT intemal control 
template (5'-AATCCGTCGAGCAGAGTTAAAAGGCCGAGAACGAT-3') (SEQ. ID. No. 5572) 50 \iM 
each deoxynucleoside triphosphate, 2 U of Taq DNA polymerase, and 2 \\\ CHAPS protein 
extract, all in IX TRAP buffer (20 mM Tris (pH 8.3), 68 mM KCI, 1.5 mM MgCb, 1 mM EGTA, 
0.05% Tween 20). Each reaction is placed in a thermocycler block preheated to 30 C and 
incubated at 30 C for 10 minutes, then cycled for 27 cycles of 94 degrees C for 30 
seconds, 60 degrees C for 30 seconds. Reaction products are separated on a denaturing 
8% polyacrylamide gel, followed by drying of the gel and autoradiography. The internal 
control (to control for possible Taq polymerase inhibition) generates a band of 36 nt. 
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Comparison of radioactive signal integrated (e.g., by phorphorimager analysis) for 
telomerase-extended bands with the radioactive signal from a reaction performed with a 
known amount of quantification standard template (termed R8; 5- 
MTCCGTCGAGCAGAGnAG [GGTTAGJy-S') (SEQ. ID. NO. 5573) allows expression of 
telomerase activity as an absolute value. The absolute value = TPG (total product 
generated) =[(TP-TPI)/TI1/[(R8-B)/RI)] x 100, where TP = telomerase products from test 
extract, TPi = telomerase products from a heat-inactivated (75°C, 10 minutes) extract 
reaction, Tl = the signal from the internal control, R8 = the signal from the R8 qualification 
standard template reaction, B = signal from a lysis buffer-only blank reaction, and Rl = the 
internal control value for the reaction containing R8 template and NT and TSNT control 
primers. TPG values of 0-10,000 are possible, with the linear range being from 
approximately 1 to 1000 TPG. The range of 1 to 1000 TPG encompasses the minimum and 
maximum levels of telomerase activity in most tumor samples tested, while non-tumor cells 
most often have no telomerase activity (TPG approximately zero). 

[0034] An alternative telomerase assay, which does not employ PGR amplification, is 
described by Raymond et al. 1999, Br. J. Cancer 80: 1332-1341. 

[0035] By "enzymatic nucleic acid molecule" it is meant an RNA molecule which has 
complementarity in a substrate binding region to a specified gene target, and also has an 
enzymatic activity which is active to specifically cleave target RNA. That is, the enzymatic 
RNA molecule is able to intermolecularly cleave RNA and thereby inactivate a target RNA 
molecule. This complementary regions allow sufficient hybridization of the enzymatic RNA 
molecule to the target RNA and thus permit cleavage. One hundred percent 
complementarity between RNA and the target gene or target RNA is preferred, but 
complementarity as low as 50-75% may also be useful in this invention. The nucleic acids 
may be modified at the base, sugar, and/or phosphate groups. The term enzymatic nucleic 
acid is used interchangeably with phrases such as ribozymes, catalytic RNA, enzymatic RNA, 
catalytic DNA, aptazyme or aptamer-binding ribozyme, regulatable ribozyme, catalytic 
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oligonucleotides, nucleozyme, DNAzyme, RNA enzyme, endoribonuclease, endonuclease, 
minizyme, leadzyme, oligozyme or DNA enzyme. All of these terminologies describe nucleic 
acid molecules with enzymatic activity. The specific enzymatic nucleic acid molecules 
described in the instant application are not meant to be limiting and those skilled in the art 
will recognize that all that is important in an enzymatic nucleic acid molecule of this invention 
is that it have a specific substrate binding site which is complementary to one or more of the 
target nucleic acid regions, and that it have nucleotide sequences within or surrounding that 
substrate binding site which impart a nucleic acid cleaving activity to the molecule (Cech et 
ai, U.S. Patent No. 4,987,071; Cech etal., 1988, JAMA). 

[0036] By "enzymatic portion" or "catalytic domain" is meant that portion/region of the 
enzymatic nucleic acid molecule essential for cleavage of a nucleic acid substrate (for 
example see Figure 1). 

[0037] By "substrate binding arm" or "substrate binding domain" is meant that 
portion/region of a ribozyme which is complementary to (i.e., able to base-pair with) a 
portion of its substrate. Generally, such complementarity is 100%, but can be less if 
desired. For example, as few as 10 bases out of 14 may be base-paired. Such arms are 
shown generally in Figure 1. That is, these arms contain sequences within a ribozyme which 
are intended to bring ribozyme and target RNA together through complementary base-pairing 
interactions. The ribozyme of the invention may have binding arms that are contiguous or 
non-contiguous and may be of varying lengths. The length of the binding arm(s) are 
preferably greater than or equal to four nucleotides and of sufficient length to stably interact 
with the target RNA; specifically 12-100 nucleotides; more specifically 14-24 nucleotides 
long. If two binding arms are chosen, the design is such that the length of the binding arms 
are symmetrical (i.e., each of the binding arms is of the same length; e.g., five and five 
nucleotides, six and six nucleotides or seven and seven nucleotides long) or asymmetrical 
{i.e., the binding arms are of different length; e.g., six and three nucleotides; three and six 
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nucleotides long; four and five nucleotides long; four and six nucleotides long; four and 
seven nucleotides long; and the like). 

[0038] By DNAzyme is meant, an enzymatic nucleic acid molecule lacking a 2'-0H group. 
In particular embodiments the enzymatic nucleic acid molecule may have an attached 
linker(s) or other attached or associated groups, moieties, or chains containing one or more 
nucleotides with 2'-0H groups. 

[0039] By "sufficient length" is meant an oligonucleotide of greater than or equal to 3 
nucleotides, 5 nucleotides, 7 nucleotides, 9 nucleotides or even 12 nucleotides. 

[0040] By "stably interact" is meant, interaction of the oligonucleotides with target nucleic 
acid (e.g., by forming hydrogen bonds with complementary nucleotides in the target under 
physiological conditions). 

[0041] By "equivalent" RNA to telomerase enzyme is meant to include those naturally 
occurring RNA molecules having homology (partial or complete) to nucleic acid sequences 
encoding telomerase proteins or encoding for proteins with similar function as telomerase in 
various organisms, including human, rodent, primate, rabbit, pig, protozoans, fungi, plants, 
and other microorganisms and parasites. The equivalent RNA sequence also includes in 
addition to the coding region, regions such as 5'-untranslated region, 3'-untranslated region, 
introns, intron-exon junction and the like. 

[0042] By "homology" is meant the nucleotide sequence of two or more nucleic acid 
molecules is partially or completely identical. 

[0043] By "antisense nucleic acid" it is meant a non-enzymatic nucleic acid molecule that 
binds to target RNA by means of RNA-RNA or RNA-DNA or RNA-PNA (protein nucleic acid; 
Egholm et a/., 1993 Nature 365, 566) interactions and alters the activity of the target RNA 
(for a review see Stein and Cheng, 1993 Science 261, 1004). Typically, antisense 
molecules will be complementary to a target sequence along a single contiguous sequence 
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of the antisense molecule. However, in certain embodiments, an antisense molecule may 
bind to substrate such that the substrate molecule forms a loop, and/or an antisense 
molecule may bind such that the antisense molecule forms a loop. Thus, the antisense 
molecule may be complementary to two (or even more) non-contiguous substrate sequences 
or two (or even more) non-contiguous sequence portions of an antisense molecule may be 
complementary to a target sequence or both. 

[0044] By "2-5A antisense chimera" it is meant, an antisense oligonucleotide containing a 
5' phosphorylated 2'-5'-linked adenylate residues. These chimeras bind to target RNA in a 
sequence-specific manner and activate a cellular 2-5A-dependent ribonuclease which, in turn, 
cleaves the target RNA (Torrence et a/., 1993 Proc. Natl. Acad. Sci. USA 90, 1300). 

[0045] By "triplex DNA° it is meant an oligonucleotide that can bind to a double-stranded 
DNA in a sequence-specific manner to form a triple-strand helix. Formation of such triple 
helix structure has been shown to inhibit transcription of the targeted gene (Duval-Valentin et 
a/., 1992 Proc. Natl. Acad. Sci.USA 89, 504). 

[0046] By "gene" it is meant a nucleic acid that encodes an RNA. 

[0047] By "complementarity" is meant that a nucleic acid can form hydrogen bond(s) with 
another RNA sequence by either traditional Watson-Crick or other non-traditional types. In 
reference to the nucleic molecules of the present invention, the binding free energy for a 
nucleic acid molecule with its target or complementary sequence is sufficient to allow the 
relevant function of the nucleic acid to proceed, e.g., ribozyme cleavage, antisense or triple 
helix inhibition. Determination of binding free energies for nucleic acid molecules is well 
known in the art (see, e.g., Turner et al., 1987, CSH Symp. Quant. Biol. Lll pp. 123-133; 
Frier et al., 1986, Proc. Nat. Acad. Sci. USA 83:9373-9377; Turner et al., 1987, J. Am. 
Chem. Soc. 109:3783-3785. A percent complementarity indicates the percentage of 
contiguous residues in a nucleic acid molecule which can form hydrogen bonds (e.g., 
Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out 
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of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). "Perfectly 
complementary" means that all the contiguous residues of a nucleic acid sequence will 
hydrogen bond with the same number of contiguous residues in a second nucleic acid 
sequence. 

[0048] At least seven basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and thus 
can cleave other RNA molecules) under physiological conditions. Table I summarizes some 
of the characteristics of these ribozymes. In general, enzymatic nucleic acids act by first 
binding to a target RNA. Such binding occurs through the target binding portion of a 
enzymatic nucleic acid which is held in close proximity to an enzymatic portion of the 
molecule that acts to cleave the target RNA. Thus, the enzymatic nucleic acid first 
recognizes and then binds a target RNA through complementary base-pairing, and once 
bound to the correct site, acts enzymatically to cut the target RNA. Strategic cleavage of 
such a target RNA will destroy its ability to direct synthesis of an encoded protein. After an 
enzymatic nucleic acid has bound and cleaved its RNA target, it is released from that RNA to 
search for another target and can repeatedly bind and cleave new targets. Thus, a single 
ribozyme molecule is able to cleave many molecules of target RNA. In addition, the 
ribozyme is a highly specific inhibitor of gene expression, with the specificity of inhibition 
depending not only on the base-pairing mechanism of binding to the target RNA, but also on 
the mechanism of target RNA cleavage. Single mismatches, or base-substitutions, near the 
site of cleavage can completely eliminate catalytic activity of a ribozyme. 

[0049] The enzymatic nucleic acid molecule that cleave the specified sites in telomerase- 
specific RNAs represent a novel therapeutic approach to treat a variety of pathologic 
indications, including, cancer, tumorigenesis, restenosis and others, 

[0050] In one of the preferred embodiments of the inventions described herein, the 
enzymatic nucleic acid molecule is formed in a hammerhead or hairpin motif, but may also 
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be formed in the motif of a hepatitis delta virus, group I intron, group II intron or RNase P 
RNA (in association with an RNA guide sequence), Neurospora VS RNA, DNAzymes, NCH 
cleaving motifs, or G-cleavers. Examples of such hammerhead motifs are described by 
Dreyfus, supra, Rossi et a/., 1992, AIDS Research arid Human Retroviruses 8, 183; of 
hairpin motifs by Hampel et ai, EP0360257, Hampel and Tritz, 1989 Biochemistry 28, 
4929, Feldstein et ai., 1989, Gene 82, 53, Haseloff and Gerlach, 1989, Gene, 82, 43, and 
Hampel et ai, 1990 Nucleic Acids Res. 18, 299; Chowrira & McSwiggen, US. Patent No. 
5,631,359; of the hepatitis delta virus motif is described by Perrotta and Been, 1992 
Biochemistry 31, 16; of the RNase P motif by Guerrier-Takada et a/., 1983 Cell 35, 849; 
Forster and Altman, 1990, Science 249, 783; Li and Altman, 1996, Nucleic Acids Res. 24, 
835; Neurospora VS RNA ribozyme motif is described by Collins (Saville and Collins, 1990 
Cell 61, 685-696; Saville and Collins, 1991 Proc. Natl. Acad. Sci. USA 88, 8826-8830; 
Collins and Olive, 1993 Biochemistry 32, 2795-2799; Guo and Collins, 1995, EMBO. J. 14, 
363); Group II introns are described by Griffin etai, 1995, Chem. Biol. 2, 761; Michels and 
Pyle, 1995, 8/oc/?em;stry 34, 2965; Pyle et al., Intemational PCT Publication No. WO 
96/22689; of the Group I intron by Cech et ai, U.S. Patent 4,987,071 and of DNAzymes by 
Usman et ai. International PCT Publication No. WO 95/11304; Chartrand etai, 1995, NAR 
23, 4092; Breaker et ai, 1995, Chem. Bio. 2, 655; Santoro et ai, 1997, PNAS 94, 4262. 
NCH cleaving motifs are described in Ludwig & Sproat, International PCT Publication No. WO 
98/58058; and G-cleavers are described in Kore etai, 1998, Nucleic Acids Research 26, 
41164120 and Eckstein et ai. International PCT Publication No. WO 99/16871. Additional 
motifs such as the Aptazyme (Breaker et ai, WO 98/43993), Amberzyme (Class I motif; 
Figure 3; Beigelman etai, U.S. Serial No. 09/301,511) and Zinzyme (Beigelman etai, U.S. 
Serial No. 09/301,511) can also be used in the present invention. These specific motifs are 
not limiting in the invention and those skilled in the art will recognize that all that is important 
in an enzymatic nucleic acid molecule of this invention is that it has a specific substrate 
binding site which is complementary to one or more of the target gene RNA regions, and 
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that it have nucleotide sequences within or surrounding that substrate binding site which 
impart an RNA cleaving activity to the molecule (Cech et ai, U.S. Patent No. 4,987,071). 

[0051] In preferred embodiments of the present invention, a nucleic acid molecule, e.g., 
an antisense molecule, a triplex DNA, or a ribozyme, is 13 to 100 nucleotides in length, e.g., 
in specific embodiments 35, 36, 37, or 38 nucleotides in length {e.g., for particular 
ribozymes or antisense). In particular embodiments, the nucleic acid molecule is 15-100, 
17-100, 20-100, 21-100, 23-100, 25-100, 27-100, 30-100, 32-100, 35-100, 40-100, 50- 
100, 60-100, 70-100, or 80-100 nucleotides in length. Instead of 100 nucleotides being 
the upper limit on the length ranges specified above, the upper limit of the length range can 
be, for example, 30, 40, 50, 60, 70, or 80 nucleotides. Thus, for any of the length ranges, 
the length range for particular embodiments has lower limit as specified, with an upper limit 
as specified which is greater than the lower limit. For example, in a particular embodiment, 
the length range can be 35-50 nucleotides in length. All such ranges are expressly included. 
Also in particular embodiments, a nucleic acid molecule can have a length which is any of the 
lengths specified above, for example, 21 nucleotides in length. 

[0052] In a preferred embodiment the invention provides a method for producing a class 
of nucleic acid -based gene inhibiting agents which exhibit a high degree of specificity for 
the RNA of a desired target. For example, the enzymatic nucleic acid molecule is preferably 
targeted to a highly conserved sequence region of target RNAs encoding telomerase 
proteins (specifically TERT gene) such that specific treatment of a disease or condition can 
be provided with either one or several nucleic acid molecules of the invention. Such nucleic 
acid molecules can be delivered exogenously to specific tissue or cellular targets as 
required. Alternatively, the nucleic acid molecules {e.g., ribozymes and antisense) can be 
expressed from DNA and/or RNA vectors that are delivered to specific cells. 
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[0053] By "highly conserved sequence region" is meant a nucleotide sequence of one or 
more regions in a target gene does not vary significantly from one generation to the other or 
from one biological system to the other. 

[0054] The nucleic acid-based inhibitors of telomerase expression are useful for the 
prevention of the diseases and conditions including cancer, macular degeneration, 
restenosis, certain infectious diseases, transplant rejection and autoimmune disease such as 
multiple sclerosis, lupus, and AIDS; Age related disease such as macular degeneration, skin 
ulceration, and rheumatoid arthritis and any other diseases or conditions that are related to 
the levels of telomerase in a cell or tissue. 

[0055] By "related" is meant that the reduction of telomerase expression (specifically 
TERT gene) RNA levels and thus reduction in the level of the respective protein will relieve, to 
some extent, the symptoms of the disease or condition. 

[0056] The nucleic acid-based inhibitors of the invention are added directly, or can be 
complexed with cationic lipids, packaged within liposomes, or otherwise delivered to target 
cells or tissues. The nucleic acid or nucleic acid complexes can be locally administered to 
relevant tissues ex vivo, or in vivo through injection, infusion pump or stent, with or without 
their incorporation in biopolymers. In preferred embodiments, the enzymatic nucleic acid 
inhibitors comprise sequences which are complementary to the substrate sequences in 
Tables lll-VII. Examples of such enzymatic nucleic acid molecules also are shown in 
Tables III to VII. Examples of such enzymatic nucleic acid molecules consist essentially of 
sequences defined in these Tables. 

[0057] In yet another embodiment, the invention features antisense nucleic acid 
molecules and 2-5A chimera including sequences complementary to the substrate 
sequences shown in tables III to VII. Such nucleic acid molecules can include sequences 
as shown for the binding arms of the enzymatic nucleic acid molecules in Tables III to VII. 
Similarly, triplex molecules can be provided targeted to the corresponding DNA target 
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regions, and containing the DNA equivalent of a target sequence or a sequence 
complementary to the specified target (substrate) sequence. Typically, antisense molecules 
will be complementary to a target sequence along a single contiguous sequence of the 
antisense molecule. However, in certain embodiments, an antisense molecule may bind to 
substrate such that the substrate molecule forms a loop, and/or an antisense molecule may 
bind such that the antisense molecule forms a loop. Thus, the antisense molecule may be 
complementary to two (or even more) non-contiguous substrate sequences or two (or even 
more) non-contiguous sequence portions of an antisense molecule may be complementary to 
a target sequence or both. 

[0058] By "consists essentially of" is meant that the active ribozyme contains an 
enzymatic center or core equivalent to those in the examples, and binding arms able to bind 
mRNA such that cleavage at the target site occurs. Other sequences may be present which 
do not interfere with such cleavage. Thus, a core region may, for example, include one or 
more loop, stem-loop structure, which does not prevent enzymatic activity. The underlined 
regions in the sequences in Tables III and IV can be such a loop, and can be represented 
generally as sequence "X". For example, a core sequence for a hammerhead ribozyme can 
be a 5'-CUGAUGAG-3' and 5'-CGAA-3' connected by "X", where X is 5'-GCCGUUAGGC-3' (SEQ 
ID NO 5574), or any other Stem II region known in the art." 

[0059] In another aspect of the invention, ribozymes or antisense molecules that cleave 
target RNA molecules and inhibit telomerase enzyme (specifically TERT) activity are 
expressed from transcription units inserted into DNA or RNA vectors. The recombinant 
vectors are preferably DNA plasmids or viral vectors. Ribozyme or antisense expressing viral 
vectors could be constructed based on, but not limited to, adeno-associated virus, 
retrovirus, adenovirus, or alphavirus. Preferably, the recombinant vectors capable of 
expressing the ribozymes or antisense are delivered as described above, and persist in 
target cells. Alternatively, viral vectors may be used that provide for transient expression of 
ribozymes or antisense. Such vectors might be repeatedly administered as necessary. 
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Once expressed, the ribozymes or antisense bind to the target RNA and inhibit its function or 
expression. Delivery of ribozyme or antisense expressing vectors could be systemic, such 
as by intravenous or intramuscular administration, by administration to target cells ex-planted 
from the patient followed by reintroduction into the patient, or by any other means that would 
allow for introduction into the desired target cell. 

[0060] By "vectors" is meant any nucleic acid- and/or viral-based technique used to 
deliver a desired nucleic acid. 

[0061] By "patient" is meant an organism which is a donor or recipient of explanted cells 
or the cells themselves. "Patient" also refers to an organism to which the nucleic acid 
molecules of the invention can be administered. Preferably, a patient is a mammal or 
mammalian cells. More preferably, a patient is a human or human cells. 

[0062] The nucleic acid molecules of the instant invention, individually, or in combination 
or in conjunction with other drugs, can be used to treat diseases or conditions discussed 
above. For example, to treat a disease or condition associated with the levels of telomerase 
enzyme, the patient may be treated, or other appropriate cells may be treated, as is evident 
to those skilled in the art, individually or in combination with one or more drugs under 
conditions suitable for the treatment. 

[0063] In a further embodiment, the described molecules, such as antisene or ribozymes 
can be used in combination with other known treatments to treat conditions or diseases 
discussed above. For example, the described molecules could be used in combination with 
one or more known therapeutic agents to treat cancer. 

[0064] In another preferred embodiment, the invention features nucleic acid-based 
inhibitors (e.g., enzymatic nucleic acid molecules (ribozymes), antisense nucleic acids, 2-5A 
antisense chimeras, triplex DNA, antisense nucleic acids containing RNA cleaving chemical 



20 



MBHB00^82-I (400/118) 

groups) and methods for their use to down regulate or inhibit the expression of genes (e.g., 
TERT) capable of progression and/or maintenance of cancer. 

[0065] In another preferred embodiment, the invention features nucleic acid-based 
techniques (e.g., enzymatic nucleic acid molecules (ribozymes), antisense nucleic acids, 2- 
5A antisense chimeras, triplex DNA, antisense nucleic acids containing RNA cleaving 
chemical groups) and methods for their use to down regulate or inhibit the expression of 
TERT gene expression, 

[0066] By "comprising" is meant including, but not limited to, whatever follows the word 
"comprising". Thus, use of the term "comprising" indicates that the listed elements are 
required or mandatory, but that other elements are optional and may or may not be present. 
By "consisting of is meant including, and limited to, whatever follows the phrase "consisting 
of". Thus, the phrase "consisting of" indicates that the listed elements are required or 
mandatory, and that no other elements may be present. By "consisting essentially of is 
meant including any elements listed after the phrase, and limited to other elements that do 
not interfere with or contribute to the activity or action specified in the disclosure for the 
listed elements. Thus, the phrase "consisting essentially of" indicates that the listed 
elements are required or mandatory, but that other elements are optional and may or may 
not be present depending upon whether or not they affect the activity or action of the listed 
elements. 

[0067] Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 

Brief Descriotion of the Drawings 

[0068] Figure 1 shows the secondary structure model for seven different classes of 

enzymatic nucleic acid molecules. Arrow indicates the site of cleavage. indicate the 

target sequence. Lines interspersed with dots are meant to indicate tertiary interactions. - is 
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meant to indicate base-paired interaction. Group I Intron: P1-P9.0 represent various stem- 
loop structures (Cech et ai, 1994, Nature Struc. Bio., 1, 273). RNase P (MIRNA): EGS 
represents external guide sequence (Forster et ai, 1990, Science, 249, 783; Pace et ai, 
1990, J. Bioi Chem., 265, 3587). Group II Intron: 5'SS means 5' splice site; 3'SS means 
3'-splice site; IBS means intron binding site; EBS means exon binding site (Pyle etai, 1994, 
6/ochem/stry, 33, 2716). VS RNA: l-VI are meant to indicate six stem-loop structures; 
shaded regions are meant to indicate tertiary interaction (Collins, International PCT 
Publication No. WO 96/19577). HDV RIbozyme: : l-IV are meant to indicate four stem-loop 
structures (Been et ai, US Patent No. 5,625,047). Hammerhead Ribozyme: : l-lll are 
meant to indicate three stem-loop structures; stems l-lll can be of any length and may be 
symmetrical or asymmetrical (Usman etai, 1996, Curr. Op. Struct. Bio., 1, 527). Hairpin 
Ribozyme: Helix 1, 4 and 5 can be of any length; Helix 2 is between 3 and 8 base-pairs 
long; Y is a pyrimidine; Helix 2 (H2) is provided with a least 4 base pairs {i.e., n is 1, 2, 3 or 
4) and helix 5 can be optionally provided of length 2 or more bases (preferably 3 - 20 bases, 
i.e., m is from 1 - 20 or more). Helix 2 and helix 5 may be covalently linked by one or more 
bases (/,e., r is > 1 base). Helix 1, 4 or 5 may also be extended by 2 or more base pairs 
(e.g., 4 - 20 base pairs) to stabilize the ribozyme structure, and preferably is a protein 
binding site. In each instance, each N and N' independently is any normal or modified base 
and each dash represents a potential base-pairing interaction. These nucleotides may be 
modified at the sugar, base or phosphate. Complete base-pairing is not required in the 
helices, but is preferred. Helix 1 and 4 can be of any size (I.e., o and p is each 
independently from 0 to any number, e.g., 20) as long as some base-pairing is maintained. 
Essential bases are shown as specific bases in the structure, but those in the art will 
recognize that one or more may be modified chemically (abasic, base, sugar and/or 
phosphate modifications) or replaced with another base without significant effect. Helix 4 
can be formed from two separate molecules, i.e., without a connecting loop. The 
connecting loop when present may be a ribonucleotide with or without modifications to its 
base, sugar or phosphate, "q" > is 2 bases. The connecting loop can also be replaced 
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with a non-nucleotide linker molecule. H refers to bases A, U, or C. Y refers to pyrimidine 

bases. " " refers to a covalent bond. (Burke at a/., 1996, Nucleic Acids & Mo/. Biol., 

10, 129; Chowrira eta/., US Patent No. 5,631,359). 

[0069] Figure 2 shows examples of chemically stabilized ribozyme motifs. HH Rz, 
represents hammerhead ribozyme motif (Usman eta/., 1996, Curr. Op. Struct Bio., 1, 527); 
NCH Rz represents the NCH ribozyme motif (Ludwig & Sproat, International PCT Publication 
No. WO 98/58058); G-Cleaver, represents G-cleaver ribozyme motif (Kore eta/., 1998, 
Nucleic Acids Research 26, 41164120). N or n, represent independently a nucleotide 
which may be same or different and have complementarity to each other; rl, represents ribo- 
Inosine nucleotide; arrow indicates the site of cleavage within the target. Position 4 of the HH 
Rz and the NCH Rz is shown as having 2'-C-allyl modification, but those skilled in the art will 
recognize that this position can be modified with other modifications well known in the art, so 
long as such modifications do not significantly inhibit the activity of the ribozyme. 

[0070] Figure 3 shows an example of the Amberzyme ribozyme motif that is chemically 
stabilized (see for example Beigelman et a/., WO 99/55857; also referred to as Class I 
Motif). 

[0071] Figure 4 shows an example of the Zinzyme A ribozyme motif that is chemically 
stabilized (see for example Beigelman etal., WO 99/55857; also referred to as Class A 
Motif). 

Detailed Description of the Invention 
Mechanism of action of Nucleic Acid Molecules of the Invention 

[0072] Antisense : Antisense molecules may be modified or unmodified RNA, DNA, or 
mixed polymer oligonucleotides and primarily function by specifically binding to matching 
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sequences resulting in inhibition of peptide synthesis (Wu-Pong, Nov 1994, BioPharm, 20- 
33). The antisense oligonucleotide binds to target RNA by Watson Crick base-pairing and 
blocks gene expression by preventing ribosomal translation of the bound sequences either 
by steric blocking or by activating RNase H enzyme. Antisense molecules may also alter 
protein synthesis by interfering with RNA processing or transport from the nucleus into the 
cytoplasm (Mukhopadhyay & Roth, 1996, Crit Rev. in Oncogenesis 7, 151-190). 

[0073] In addition, binding of single stranded DNA to RNA may result in nuclease 
degradation of the heteroduplex (Wu-Pong, supra; Crooke, supra). To date, the only 
backbone modified DNA chemistry which will act as substrates for RNase H are 
phosphorothioates and phosphorodithioates. Recently it has been reported that 2'-arabino 
and 2'-fluoro arabino- containing oligos can also activate RNase H activity. 

[0074] A number of antisense molecules have been described that utilize novel 
configurations of chemically modified nucleotides, secondary structure, and/or RNase H 
substrate domains (Woolf et al., International PCT Publication No. WO 98/13526; Thompson 
et al., USSN 60/082,404 which was filed on April 20, 1998; Hartmann et al., USSN 
60/101,174 which was filed on September 21, 1998) all of these are incorporated by 
reference herein in their entirety. 

[0075] Triplex Forming Oligonucleotides (TFO) : Single stranded DNA may be designed to 
bind to genomic DNA in a sequence specific manner. TFOs are comprised of pyrimidine-rich 
oligonucleotides which bind DNA helices through Hoogsteen Base-pairing (Wu-Pong, supra). 
The resulting triple helix composed of the DNA sense, DNA antisense, and TFO disrupts RNA 
synthesis by RNA polymerase. The TFO mechanism may result in gene expression or cell 
death since binding may be irreversible (Mukhopadhyay & Roth, supra) 

[0076] 2-5A Antisense Chimera : The 2-5A system is an interferon mediated mechanism 
for RNA degradation found in higher vertebrates (Mitra et al.. 1996, Proc Nat Acad Sci USA 
93, 6780-6785). Two types of enzymes, 2-5A synthetase and RNase L, are required for 
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RNA cleavage. The 2-5A synthetases require double stranded RNA to form 2'-5' 
oligoadenylates (2-5A). 2-5A then acts as an allosteric effector for utilizing RNase L which 
has the ability to cleave single stranded RNA. The ability to form 2-5A structures with double 
stranded RNA makes this system particularly useful for inhibition of viral replication. 

[0077] (2'-5') oligoadenylate structures may be covalently linked to antisense molecules 
to form chimeric oligonucleotides capable of RNA cleavage (Torrence, supra). These 
molecules putatively bind and activate a 2-5A dependent RNase, the oligonucleotide/enzyme 
complex then binds to a target RNA molecule which can then be cleaved by the RNase 
enzyme. 

[0078] Enzymatic Nucleic Acid : Seven basic varieties of naturally-occurring enzymatic 
RNAs are presently known. In addition, several in vitro selection (evolution) strategies (Orgel, 
1979, Proc. R. Soc. Lor)don, B 205, 435) have been used to evolve new nucleic acid 
catalysts capable of catalyzing cleavage and ligation of phosphodiester linkages (Joyce, 
1989, Gene, 82, 83-87; Beaudry et a/., 1992, Science 257, 635-641; Joyce, 1992, 
Scientific American 267, 90-97; Breaker eta/., 1994, TIBTECH 12, 268; Bartel eta/.,1993. 
Science 261:1411-1418; Szostak, 1993, TIBS 17, 89-93; Kumar etal., 1995, FASEBl, 9, 
1183; Breaker, 1996, Curr. Op. Biotech., 7, 442; Santoro et al., 1997, Proc. Natl. Acad. 
So/., 94, 4262; Tang et a/., 1997, RNA 3, 914; Nakamaye & Eckstein, 1994, supra; Long & 
Uhlenbeck, 1994, supra; Ishizaka et al., 1995, supra; Vaish eta/., 1997, 6;oc/)em/stry 36, 
6495; all of these are incorporated by reference herein). Each can catalyze a series of 
reactions including the hydrolysis of phosphodiester bonds in trans (and thus can cleave 
other RNA molecules) under physiological conditions. 

[0079] Nucleic acid molecules of this invention will block to some extent telomerase 
protein expression (specifically TERT) and can be used to treat disease or diagnose disease 
associated with the levels of telomerase enzyme. 
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[0080] The enzymatic nature of a ribozyme has significant advantages, such as the 
concentration of ribozyme necessary to affect a therapeutic treatment is lower. This 
advantage reflects the ability of the ribozyme to act enzymatically. Thus, a single ribozyme 
molecule is able to cleave many molecules of target RNA. In addition, the ribozyme is a 
highly specific inhibitor, with the specificity of inhibition depending not only on the base- 
pairing mechanism of binding to the target RNA, but also on the mechanism of target RNA 
cleavage. Single mismatches, or base-substitutions, near the site of cleavage can be 
chosen to completely eliminate catalytic activity of a ribozyme. 

[0081] Nucleic acid molecules having an endonuclease enzymatic activity are able to 
repeatedly cleave other separate RNA molecules in a nucleotide base sequence-specific 
manner. Such enzymatic nucleic acid molecules can be targeted to virtually any RNA 
transcript, and achieved efficient cleavage in vitro (Zaug et a/., 324, Nature 429 1986 ; 
Uhlenbeck, 1987 Nature 328, 596; Kim et al., 84 Proc. Natl. Acad. Sci. USA 8788, 1987; 
Dreyfus, 1988, Einstein Quart. J. Bio. Med., 6, 92; Haseloff and Gerlach, 334 Nature 585, 
1988; Cech, 260 JAMA 3030, 1988; and Jefferies et al., 17 Nucleic Acids Research 1371, 
1989; Santoro etai, 1997 supra). 

[0082] Because of their sequence specificity, trans-cleaving ribozymes show promise as 
therapeutic agents for human disease (Usman & McSwiggen, 1995 Ann. Rep. Med. Chem. 
30, 285-294; Christoffersen and Marr, 1995 J. Med. Chem. 38, 2023-2037). Ribozymes 
can be designed to cleave specific RNA targets within the background of cellular RNA. Such 
a cleavage event renders the RNA non-functional and abrogates protein expression from that 
RNA. In this manner, synthesis of a protein associated with a disease state can be 
selectively inhibited. 

Target sites 

[0083] Targets for useful ribozymes and antisense nucleic acids can be determined as 
disclosed in Draper et al., WO 93/23569; Sullivan et al., WO 93/23057; Thompson et al., 
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WO 94/02595; Draper et ai, WO 95/04818; McSwiggen et ai, US Patent No. 5,525,468, 
and hereby incorporated by reference herein in totality. Other examples include the following 
PCT applications which concern inactivation of expression of disease-related genes: WO 
95/23225, WO 95/13380, WO 94/02595, incorporated by reference herein. Rather than 
repeat the guidance provided in those documents here, below are provided specific 
examples of such methods, not limiting to those in the art. Ribozymes and antisense to such 
targets are designed as described in those applications and synthesized to be tested in vitro 
and in vivo, as also described. The sequence of human TERT RNAs were screened for 
optimal enzymatic nucleic acid and antisense target sites using a computer folding 
algorithm. Antisense, hammerhead, DNAzyme, NCH, or G-Cleaver ribozyme 
binding/cleavage sites were identified. These sites are shown in Tables III to VII (all 
sequences are 5' to 3' in the tables; the underlined region can be any base-paired sequence, 
the actual sequence is not relevant here). The nucleotide base position is noted in the 
Tables as that site to be cleaved by the designated type of enzymatic nucleic acid molecule. 
While human sequences can be screened and enzymatic nucleic acid molecule and/or 
antisense thereafter designed, as discussed in Stinchcomb et ai., WO 95/23225, mouse 
targeted ribozymes may be useful to test efficacy of action of the enzymatic nucleic acid 
molecule and/or antisense prior to testing in humans. 

[0084] Antisense, hammerhead, DNAzyme, NCH, or G-Cleaver ribozyme binding/cleavage 
sites were identified. The nucleic acid molecules were individually analyzed by computer 
folding (Jaeger et a/., 1989 Proc. Nati. Acad. Sci. USA, 86, 7706) to assess whether the 
sequences fold into the appropriate secondary structure. Those nucleic acid molecules with 
unfavorable intramolecular interactions such as between the binding arms and the catalytic 
core were eliminated from consideration. Varying binding arm lengths can be chosen to 
optimize activity. 

[0085] Antisense, hammerhead, DNAzyme, NCH, or G-Cleaver ribozyme binding/cleavage 
sites were identified and were designed to anneal to various sites in the RNA target. The 
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binding arms are complennentary to the target site sequences described above. The nucleic 
acid molecules were chemically synthesized. The method of synthesis used follows the 
procedure for normal DNA/RNA synthesis as described below and in Usman et ai, 1987 J. 
Am. Chem. Soc, 109, 7845; Scaringe et a/., 1990 Nucleic Acids Res., 18, 5433; and 
Wincott et ai, 1995 Nucleic Acids Res. 23, 2677-2684; Caruthers et ai, 1992, Methods in 
Enzymology 211,3-19. 

Synthesis of Nucleic acid Molecules 

[0086] Synthesis of nucleic acids greater than 100 nucleotides in length is difficult using 
automated methods, and the therapeutic cost of such molecules is prohibitive. In this 
invention, small nucleic acid motifs ("small" refers to nucleic acid motifs no more than 100 
nucleotides in length, preferably no more than 80 nucleotides in length, and most preferably 
no more than 50 nucleotides in length; e.g., antisense oligonucleotides, hammerhead or the 
hairpin ribozymes) are preferably used for exogenous delivery. The simple structure of 
these molecules increases the ability of the nucleic acid to invade targeted regions of RNA 
structure. Exemplary molecules of the instant invention were chemically synthesized, and 
others can similarly be synthesized. Oligodeoxyribonucleotides were synthesized using 
standard protocols as described in Caruthers et a/., 1992, Methods in Enzymology 211,3- 
19, and is incorporated herein by reference. 

[0087] The method of synthesis used for normal RNA including certain enzymatic nucleic 
acid molecules follows the procedure as described in Usman et ai., 1987 J. Am. Chem. 
Soc, 109, 7845; Scaringe et a/., 1990 Nucleic Acids Res., 18, 5433; and Wincott et al., 
1995 Nucleic Acids Res. 23, 2677-2684 Wincott et al, 1997, Metnods Mo/. Bio., 74, 59, 
and makes use of common nucleic acid protecting and coupling groups, such as 
dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. In a non-limiting example, 
small scale syntheses were conducted on a 394 Applied Biosystems, Inc. synthesizer using 
a 0.2 Mmol scale protocol with a 7.75 min coupling step for alkylsilyl protected nucleotides 
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and a 2.5 min coupling step for 2'-0-methylated nucleotides. Table II outlines the amounts 
and the contact times of the reagents used in the synthesis cycle. Alternatively, syntheses 
at the 0.2 pmol scale can be done on a 96-well plate synthesizer, such as the instrument 
produced by Protogene (Palo Alto, CA) with minimal modification to the cycle. A 15-fold 
excess (31 pL of 0.1 M = 3.1 ^lmol) of phosphoramidite and a 38.7-fold excess of S-ethyl 
tetrazole (31 |jL of 0.25 M = 7.75 \imo\) relative to polymer-bound 5'-hydroxyl was used in 
each coupling cycle. Average coupling yields on the 394 Applied Biosystems, Inc. 
synthesizer, determined by colorimetric quantitation of the trityl fractions, were 97.5-99%. 
Other oligonucleotide synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer; 
detritylation solution was 3% TCA in methylene chloride (ABI); capping was performed with 
16% Aknethyl imidazole in THF (ABI) and 10% acetic anhydride/10% 2,6^utidine in THF (ABI); 
oxidation solution was 16.9 mM l2, 49 mM pyridine, 9% water in THF (PERSEPTIVE™). 
Burdick & Jackson Synthesis Grade acetonitrile was used directly from the reagent bottle. 
S-Ethyltetrazole solution (0.25 M in acetonitrile) was made up from the solid obtained from 
American International Chemical, Inc. 

[0088] Deprotection of the RNA was performed using either a two-pot or one-pot 
protocol. For the two-pot protocol, the polymer-bound trityl-on oligoribonucleotide was 
transferred to a 4 mL glass screw top vial and suspended in a solution of 40% aq. 
methylamine (1 mL) at 65 X for 10 min. After cooling to -20 °C, the supernatant was 
removed from the polymer support. The support was washed three times with 1.0 mL of 
EtOH:MeCN:H20/3:l:l, vortexed and the supernatant was then added to the first 
supernatant. The combined supernatants, containing the oligoribonucleotide, were dried to a 
white powder. The base deprotected oligoribonucleotide was resuspended in anhydrous 
TEA/HF/NMP solution (300 pL of a solution of 1.5 mL N-methylpyrrolidinone, 750 \il TEA 
and 1 mL TEA-3HF to provide a 1.4 M HF concentration) and heated to 65 X. After 1.5 h, 
the oligomer was quenched with 1.5 M NH4HCO3. 
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[0089] Alternatively, for the one-pot protocol, the polymer-bound trityl-on 
oligoribonucleotide was transferred to a 4 mL glass screw top vial and suspended in a 
solution of 33% ethanolic methylamine/DMSO:l/l (0.8 mL) at 65 X for 15 min. The vial 
was brought to r.t. TEA«3HF (0.1 mL) was added and the vial was heated at 65 °C for 15 
min. The sample was cooled at -20 X and then quenched with 1.5 M NH4HCO3. 

[0090] For purification of the trityl-on oligomers, the quenched NH4HCO3 solution was 
loaded onto a C-18 containing cartridge that had been prewashed with acetonitrile followed 
by 50 mM TEAA. After washing the loaded cartridge with water, the RNA was detritylated 
with 0.5% TFA for 13 min. The cartridge was then washed again with water, salt exchanged 
with 1 M NaCI and washed with water again. The oligonucleotide was then eluted with 30% 
acetonitrile. 

[0091] Inactive hammerhead ribozymes or binding attenuated control (BAG) 
oligonucleotides) were synthesized by substituting a U for G5 and a U for A14 (numbering 
from Hertel, K. J., et a/., 1992, Nucleic Acids Res. . 20, 3252). Similarly, one or more 
nucleotide substitutions can be introduced in other enzymatic nucleic acid molecules to 
inactivate the molecule and such molecules can serve as a negative control. 

[0092] The average stepwise coupling yields were >98% (Wincott et a/., 1995 Nucleic 
Acids Res. 23, 2677-2684). Those of ordinary skill in the art will recognize that the scale of 
synthesis can be adapted to be larger or smaller than the example described above 
including but not limited to 96 well format, all that is important is the ratio of chemicals used 
in the reaction. 

[0093] Alternatively, the nucleic acid molecules of the present invention can be 
synthesized separately and joined together post-synthetically, for example by ligation (Moore 
et ai, 1992, Science 256, 9923; Draper et a/., International PCT publication No. WO 
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93/23569; Shabarova et ai, 1991, Nucleic Acids Research 19, 4247; Bellon et a/., 1997, 
Nucleosides & Nucleotides, 16, 951; Bellon eta/., 1997 Bioconjugate Chem. 8, 204). 

[0094] The nucleic acid molecules of the present invention are modified extensively to 
enhance stability by modification with nuclease resistant groups, for example, 2'-amino, 2'-C- 
allyl, 2'-flouro, 2'-0-methyl, 2'-H (for a review see Usman and Cedergren, 1992 TIBS 17, 34; 
Usman et ai, 1994 Nucleic Acids Symp. Ser. 31, 163). Ribozymes are purified by gel 
electrophoresis using general methods or are purified by high pressure liquid 
chromatography (HPLC; See Wincott et ai, Supra, the totality of which is hereby 
incorporated herein by reference) and are re-suspended in water. 

[0095] The sequences of the ribozymes that are chemically synthesized, useful in this 
study, are shown in Tables III to VII. Those in the art will recognize that these sequences 
are representative only of many more such sequences where the enzymatic portion of the 
ribozyme (all but the binding arms) is altered to affect activity. The ribozyme sequences 
listed in Tables III to V and VII may be formed of ribonucleotides or other nucleotides or 
non-nucleotides. Such ribozymes with enzymatic activity are equivalent to the ribozymes 
described specifically in the Tables. 

Optimizing Activity of the nucleic acid molecule of the invention. 

[0096] Chemically synthesizing synthesizing nucleic acid molecules with modifications 
(base, sugar and/or phosphate) that prevent their degradation by serum ribonucleases may 
increase their potency (see e.g., Eckstein eta/.. International Publication No. WO 92/07065; 
Perrault et al, 1990 Nature 344, 565; Pieken et al., 1991 Science 253, 314; Usman and 
Cedergren, 1992 lrer\ds in Biochem. Sci. 17, 334; Usman et al., International Publication 
No. WO 93/15187; and Rossi et a/., International Publication No. WO 91/03162; Sproat, US 
Patent No. 5,334,711; and Burgin et ai, supra; all of these describe various chemical 
modifications that can be made to the base, phosphate and/or sugar moieties of the nucleic 
acid molecules herein). Modifications which enhance their efficacy in cells, and removal of 
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bases from nucleic acid molecules to shorten oligonucleotide synthesis times and reduce 
chemical requirements are desired. (All these publications are hereby incorporated by 
reference herein). 

[0097] There are several examples in the art describing sugar, base and phosphate 
modifications that can be introduced into nucleic acid molecules with significant 
enhancement in their nuclease stability and efficacy. For example, oligonucleotides are 
modified to enhance stability and/or enhance biological activity by modification with nuclease 
resistant groups, for example, 2'-amino, 2'-C-allyl, 2'-flouro, 2'-0-methyl, 2'-H, nucleotide base 
modifications (for a review see Usman and Cedergren, 1992 TIBS 17, 34; Usman at a/., 
1994 Nucleic Acids Symp. Ser. 31, 163; Burgin et a/., 1996 Biochemistry 35, 14090). 
Sugar modification of nucleic acid molecules have been extensively described in the art (see 
Eckstein et a/.. International Publication PCT No. WO 92/07065; Perrault et a/. Nature 
1990, 344, 565-568; Pieken et al. Science 1991, 253, 314-317; Usman and Cedergren, 
Trends in Biochem. Sci. 1992, 17, 334-339; Usman et al. International Publication PCT No. 
WO 93/15187; Sproat, US Patent No. 5,334,711 and Beigelman et al., 1995 J. Biol. 
Chem. 270, 25702; Beigelman et al., International PCT publication No. WO 97/26270; 
Beigelman et al., US Patent No. 5,716,824; Usman et al., US patent No. 5,627,053; Woolf 
etal., International PCT Publication No. WO 98/13526; Thompson etal., USSN 60/082,404 
Which was filed on April 20, 1998; Karpeisky et al., 1998 Tetrahedron Lett. 39, 1131; all of 
the references are hereby incorporated in their totality by reference herein). Such 
publications describe general methods and strategies to determine the location of 
incorporation of sugar, base and/or phosphate modifications and the like into ribozymes 
without inhibiting catalysis, and are incorporated by reference herein. In view of such 
teachings, similar modifications can be used as described herein to modify the nucleic acid 
molecules of the instant invention. 

[0098] While chemical modification of oligonucleotide internucleotide linkages with 
phosphorothioate, phosphorothioate, and/or 5'-methylphosphonate linkages improves 
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Stability, too many of these modifications may cause some toxicity. Therefore when 
designing nucleic acid molecules the amount of these internucleotide linkages should be 
minimized. The reduction in the concentration of these linkages should lower toxicity 
resulting in increased efficacy and higher specificity of these molecules. 

[0099] Nucleic acid molecules having chemical modifications which maintain or enhance 
activity are provided. Such nucleic acid is also generally more resistant to nucleases than 
unmodified nucleic acid. Thus, in a cell and/or in vivo the activity may not be significantly 
lowered. Therapeutic nucleic acid molecules delivered exogenously must optimally be stable 
within cells until translation of the target RNA has been inhibited long enough to reduce the 
levels of the undesirable protein. This period of time varies between hours to days 
depending upon the disease state. Clearly, nucleic acid molecules must be resistant to 
nucleases in order to function as effective intracellular therapeutic agents. Improvements in 
the chemical synthesis of RNA and DNA (Wincott et a/., 1995 Nucleic Acids Res. 23, 2677; 
Caruthers eta/., 1992, Methods in Enzymo/ogy 211,3-19) incorporated by reference herein) 
have expanded the ability to modify nucleic acid molecules by introducing nucleotide 
modifications to enhance their nuclease stability as described above. 

[00100] Use of these the nucleic acid-based molecules of the invention will lead to better 
treatment of the disease progression by affording the possibility of combination therapies 
{e.g., multiple antisense or enzymatic nucleic acid molecules targeted to different genes, 
nucleic acid molecules coupled with known small molecule inhibitors, or intermittent 
treatment with combinations of molecules (including different motifs) and/or other chemical 
or biological molecules)). The treatment of patients with nucleic acid molecules may also 
include combinations of different types of nucleic acid molecules. 

[00101] Therapeutic nucleic acid molecules (e.g., enzymatic nucleic acid molecules and 
antisense nucleic acid molecules) delivered exogenously must optimally be stable within cells 
until translation of the target RNA has been inhibited long enough to reduce the levels of the 
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undesirable protein. This period of time varies between hours to days depending upon the 
disease state. Clearly, these nucleic acid molecules must be resistant to nucleases in order 
to function as effective intracellular therapeutic agents. Improvements in the chemical 
synthesis of nucleic acid molecules described in the instant invention and in the art have 
expanded the ability to modify nucleic acid molecules by introducing nucleotide modifications 
to enhance their nuclease stability as described above. 

[00102] By "enhanced enzymatic activity" is meant to include activity measured in cells 
and/or in vivo where the activity is a reflection of both catalytic activity and ribozyme 
stability. In this invention, the product of these properties is increased or not significantly 
(less that 10 fold) decreased in vivo compared to an all RNA ribozyme. 

[00103] In yet another preferred embodiment, nucleic acid catalysts having chemical 
modifications which maintain or enhance enzymatic activity is provided. Such nucleic acid is 
also generally more resistant to nucleases than unmodified nucleic acid. Thus, in a cell 
and/or in vivo the activity may not be significantly lowered. As exemplified herein such 
ribozymes are useful in a cell and/or in vivo even if activity over all is reduced 10 fold {Burgin 
et ai., 1996, Biochemistry, 35, 14090). Such ribozymes herein are said to "maintain" the 
enzymatic activity on all RNA ribozyme. 

[00104] In another aspect the nucleic acid molecules comprise a 5' and/or a 3'- cap 
structure. 

[00105] By "cap structure" is meant chemical modifications, which have been incorporated 
at the terminus of the oligonucleotide (see for example Wincott et ai, WO 97/26270, 
incorporated by reference herein). These terminal modifications protect the nucleic acid 
molecule from exonuclease degradation, and may help in delivery and/or localization within a 
cell. The cap may be present at the 5'-terminus (5'-cap) or at the 3'-terminus (3'-cap) or may 
be present on both terminus. In non-limiting examples: the 5'-cap is selected from the group 
comprising inverted abasic residue (moiety), 4',5'-methylene nucleotide; Hbeta-D- 
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erythrofuranosyl) nucleotide, 4'-thio nucleotide, carbocyclic nucleotide; 1,5-anhydrohexitol 
nucleotide; L-nucleotides; alpha-nucleotides; modified base nucleotide; phosphorodithioate 
linkage; threo-pentofuranosyl nucleotide; acyclic 3',4'-seco nucleotide; acyclic 3,4- 
dihydroxybutyl nucleotide; acyclic 3,5-dihydroxypentyl nucleotide, 3'-3'-inverted nucleotide 
moiety; 3'-3'-inverted abasic moiety; 3'-2'-inverted nucleotide moiety; 3'-2 '-inverted abasic 
moiety; 1,4-butanediol phosphate; 3'-phosphoramidate; hexylphosphate; aminohexyl 
phosphate; 3'-phosphate; 3'-phosphorothioate; phosphorodithioate; or bridging or non- 
bridging methylphosphonate moiety (for more details see Beigelman et a/., International PCT 
publication No. WO 97/26270, incorporated by reference herein). In yet another preferred 
embodiment the 3'-cap is selected from a group comprising, 4',5'-methylene nucleotide; 1- 
(beta-D-erythrofuranosyl) nucleotide; 4'-thio nucleotide, carbocyclic nucleotide; 5'-amino-alkyl 
phosphate; l,3-diamino-2-propyl phosphate, 3-aminopropyl phosphate; 6-aminohexyl 
phosphate; 1,2-aminododecyl phosphate; hydroxypropyl phosphate; 1,5-anhydrohexitol 
nucleotide; L-nucleotide; alpha-nucleotide; modified base nucleotide; phosphorodithioate; 
threo-pentofuranosyl nucleotide; acyclic 3',4'-seco nucleotide; 3,4-dihydroxybutyl nucleotide; 
3,5-dihydroxypentyl nucleotide, 5'-5'-inverted nucleotide moeity; 5 '-5 '-inverted abasic moeity; 
5'-phosphoramidate; 5'-phosphorothioate; 1,4-butanediol phosphate; 5'-amino; bridging 
and/or non-bridging 5'-phosphoramidate, phosphorothioate and/or phosphorodithioate, 
bridging or non bridging methylphosphonate and 5'-mercapto moeities (for more details see 
Beaucage and Iyer, 1993, Tetrahedron 49, 1925; incorporated by reference herein). By the 
term "non-nucleotide" is meant any group or compound which can be incorporated into a 
nucleic acid chain in the place of one or more nucleotide units, including either sugar and/or 
phosphate substitutions, and allows the remaining bases to exhibit their enzymatic activity. 
The group or compound is abasic in that it does not contain a commonly recognized 
nucleotide base, such as adenosine, guanine, cytosine, uracil or thymine. 

[00106] An "alkyl" group refers to a saturated aliphatic hydrocarbon, including straight- 
chain, branched-chain, and cyclic alkyl groups. Preferably, the alkyl group has 1 to 12 
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carbons. More preferably it is a lower alky! of from 1 to 7 carbons, nnore preferably 1 to 4 
carbons. The alkyi group nnay be substituted or unsubstituted. When substituted the 
substituted group(s) is preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2 or N(CH3)2, amino, 
or SH. The term also includes alkenyl groups which are unsaturated hydrocarbon groups 
containing at least one carbon-carbon double bond, including straight-chain, branched-chain, 
and cyclic groups. Preferably, the alkenyl group has 1 to 12 carbons. More preferably it is 
a lower alkenyl of from 1 to 7 carbons, more preferably 1 to 4 carbons. The alkenyl group 
may be substituted or unsubstituted. When substituted the substituted group{s) is 
preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2, halogen, N(CH3)2, amino, or SH. The 
term "alkyl" also includes alkynyl groups which have an unsaturated hydrocarbon group 
containing at least one carbon-carbon triple bond, including straight-chain, branched-chain, 
and cyclic groups. Preferably, the alkynyl group has 1 to 12 carbons. More preferably it is 
a lower alkynyl of from 1 to 7 carbons, more preferably 1 to 4 carbons. The alkynyl group 
may be substituted or unsubstituted. When substituted the substituted group(s) is 
preferably, hydroxyl, cyano, alkoxy, =0, =S, NO2 or N(CH3)2, amino or SH. 

[00107] Such alkyl groups may also include aryl, alkylaryl, carbocyclic aryl, heterocyclic 
aryl, amide and ester groups. An "aryl" group refers to an aromatic group which has at least 
one ring having a conjugated p electron system and includes carbocyclic aryl, heterocyclic 
aryl and biaryl groups, all of which may be optionally substituted. The preferred 
substituent(s) of aryl groups are halogen, trihalomethyl, hydroxyl, SH, OH, cyano, alkoxy, 
alkyl, alkenyl, alkynyl, and amino groups. An "alkylaryl" group refers to an alkyl group (as 
described above) covalently joined to an aryl group (as described above. Carbocyclic aryl 
groups are groups wherein the ring atoms on the aromatic ring are all carbon atoms. The 
carbon atoms are optionally substituted. Heterocyclic aryl groups are groups having from 1 
to 3 heteroatoms as ring atoms in the aromatic ring and the remainder of the ring atoms are 
carbon atoms. Suitable heteroatoms include oxygen, sulfur, and nitrogen, and include 
furanyl, thienyl, pyridyl, pyrrolyl, N-lower alkyl pyrrolo, pyrimidyl, pyrazinyl, imidazolyl and the 
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like, all optionally substituted. An "amide" refers to an -C(0)-NH-R, where R is either alkyi, 
aryl, alkylaryl or hydrogen. An "ester" refers to an -C{0)-OR', where R is either alkyI, aryl, 
alkylaryl or hydrogen. 

[00108] By "nucleotide" as used herein Is as recognized in the art to include natural bases 
(standard), and modified bases well known in the art. Such bases are generally located at 
the r position of a nucleotide sugar moiety. Nucleotides generally comprise a base, sugar 
and a phosphate group. The nucleotides can be unmodified or modified at the sugar, 
phosphate and/or base moiety, (also referred to interchangeably as nucleotide analogs, 
modified nucleotides, non-natural nucleotides, non-standard nucleotides and other; see for 
example, Usman and McSwiggen, supra; Eckstein et al., International PCT Publication No. 
WO 92/07065; Usman et a/., International PCT Publication No. WO 93/15187; Uhlman & 
Peyman, supra) all are hereby incorporated by reference herein). There are several 
examples of modified nucleic acid bases known in the art and has recently been summarized 
by Limbach et a/., 1994, Nucleic Acids Res. 22, 2183. Some of the non-limiting examples 
of base modifications that can be introduced into nucleic acid molecules include, inosine, 
purine, pyridin4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy benzene, 3- 
methyl uracil, dihydrouridine, naphthyl, aminophenyl, 5-alkylcytidines (e.g., 5-methylcytidine), 
5-alkyluridines (e.g., ribothymidine), 5-halouridine (e.g., 5-bromouridine) or 6-azapyrimidines 
or 6-alkylpyrimidines (e.g. 6-methyluridine), propyne, and others (Burgin et a/., 1996, 
Biochemistry, 35, 14090; Uhlman & Peyman, supra). By "modified bases" in this aspect is 
meant nucleotide bases other than adenine, guanine, cytosine and uracil at 1' position or 
their equivalents; such bases may be used at any position, for example, within the catalytic 
core of an enzymatic nucleic acid molecule and/or in the substrate-binding regions of the 
nucleic acid molecule. 

[00109] By "abasic" is meant sugar moieties lacking a base or having other chemical 
groups in place of a base at the 1' position. 
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[00110] By "ribonucleotide" is meant a nucleotide with one of the bases adenine, cytosine, 
guanine, or uracil joined to the 1' carbon of p-D-ribo-furanose. 

[00111] By "unmodified nucleoside" is meant one of the bases adenine, cytosine, guanine, 
uracil joined to the 1' carbon of p-D-ribo-furanose. 

[00112] By "modified nucleoside" is meant any nucleotide base which contains a 
modification in the chemical structure of an unmodified nucleotide base, sugar and/or 
phosphate. 

[00113] In connection with 2'-modified nucleotides as described for the present invention, 
by "amino" is meant 2'-NH2 or 2-0- NH2, which may be modified or unmodified. Such 
modified groups are described, for example, in Eckstein et a!., U.S. Patent 5,672,695 and 
Matulic-Adamic et al., WO 98/28317, respectively, which are both incorporated by reference 
in their entireties. 

[00114] Various modifications to nucleic acid (e.g., antisense and ribozyme) structure can 
be made to enhance the utility of these molecules. Such modifications will enhance shelf-life, 
half-life in vitro, stability, and ease of introduction of such oligonucleotides to the target site, 
e.g., to enhance penetration of cellular membranes, and confer the ability to recognize and 
bind to targeted cells. 

[00115] Use of these molecules will lead to better treatment of the disease progression by 
affording the possibility of combination therapies (e.g., multiple ribozymes targeted to 
different genes, ribozymes coupled with known small molecule inhibitors, or intermittent 
treatment with combinations of ribozymes (including different ribozyme motifs) and/or other 
chemical or biological molecules). The treatment of patients with nucleic acid molecules 
may also include combinations of different types of nucleic acid molecules. Therapies may 
be devised which include a mixture of ribozymes (including different ribozyme motifs), 
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antisense and/or 2-5A chimera molecules to one or more targets to alleviate symptoms of a 
disease. 

Administration of Nucleic Acid Molecules 

[00116] Methods for the delivery of nucleic acid molecules are described in Akhtar et a/,, 
1992, Trends Cell Bio., 2, 139; and Delivery Strategies for Antisense Oligonucleotide 
Therapeutics, ed. Akhtar, 1995 which are both incorporated herein by reference. Sullivan et ai, 
PCT WO 94/02595, further describes the general methods for delivery of enzymatic RNA 
molecules. These protocols may be utilized for the delivery of virtually any nucleic acid molecule. 
Nucleic acid molecules may be administered to cells by a variety of methods known to those 
familiar to the art, including, but not restricted to, encapsulation in liposomes, by iontophoresis, 
or by incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, nucleic acid molecules may 
be directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 
Alternatively, the nucleic acid/vehicle combination is locally delivered by direct injection or by use 
of a catheter, infusion pump or stent. Other routes of delivery include, but are not limited to, 
intravascular, intramuscular, subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill 
form), topical, systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed 
descriptions of nucleic acid delivery and administration are provided in Sullivan et ai, supra and 
Draper et a/., PCT W093/23569 which have been incorporated by reference herein. 

[00117] The molecules of the instant invention can be used as pharmaceutical agents. 
Pharmaceutical agents prevent, inhibit the occurrence, or treat (alleviate a symptom to some 
extent, preferably all of the symptoms) of a disease state in a patient. 

[00118] The negatively charged polynucleotides of the invention can be administered (e.g., 
RNA, DNA or protein) and introduced into a patient by any standard means, with or without 
stabilizers, buffers, and the like, to form a pharmaceutical composition. When it is desired 
to use a liposome delivery mechanism, standard protocols for formation of liposomes can 
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be followed. The compositions of the present invention may also be formulated and used as 
tablets, capsules or elixirs for oral administration; suppositories for rectal administration; 
sterile solutions; suspensions for injectable administration; and the like. 

[00119] The present invention also includes pharmaceutically acceptable formulations of 
the compounds described. These formulations include salts of the above compounds, e.g., 
acid addition salts, for example, salts of hydrochloric, hydrobromic, acetic acid, and 
benzene sulfonic acid. 

[00120] A pharmacological composition or formulation refers to a composition or 
formulation in a form suitable for administration, e.g., systemic administration, into a cell or 
patient, preferably a human. Suitable forms, in part, depend upon the use or the route of 
entry, for example oral, transdermal, or by injection. Such forms should not prevent the 
composition or formulation to reach a target cell {i.e., a cell to which the negatively charged 
polymer is desired to be delivered to). For example, pharmacological compositions injected 
into the blood stream should be soluble. Other factors are known in the art, and include 
considerations such as toxicity and forms which prevent the composition or formulation from 
exerting its effect. 

[00121] By "systemic administration" is meant in vivo systemic absorption or accumulation 
of drugs in the blood stream followed by distribution throughout the entire body. 
Administration routes which lead to systemic absorption include, without limitations: 
intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary and 
intramuscular. Each of these administration routes expose the desired negatively charged 
polymers, e.g., nucleic acids, to an accessible diseased tissue. The rate of entry of a drug 
into the circulation has been shown to be a function of molecular weight or size. The use of 
a liposome or other drug carrier comprising the compounds of the instant invention can 
potentially localize the drug, for example, in certain tissue types, such as the tissues of the 
reticular endothelial system (RES). A liposome formulation which can facilitate the 
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association of drug witli the surface of cells, such as, lymphocytes and macrophages is also 
useful. This approach may provide enhanced delivery of the drug to target cells by taking 
advantage of the specificity of macrophage and lymphocyte immune recognition of abnormal 
cells, such as the cancer cells. 

[00122] The invention also features the use of the composition comprising surface- 
modified liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long-circulating 
liposomes or stealth liposomes). These formulations offer an method for increasing the 
accumulation of drugs in target tissues. This class of drug carriers resists opsonization and 
elimination by the mononuclear phagocytic system (MPS or RES), thereby enabling longer 
blood circulation times and enhanced tissue exposure for the encapsulated drug (Lasic et ai 
Chem. Rev. 1995, 95, 2601-2627; Ishiwata et a/., Chem. Pharm. Bull. 1995, 43, 1005- 
1011). Such liposomes have been shown to accumulate selectively in tumors, presumably 
by extravasation and capture in the neovascularized target tissues (Lasic et ai, Science 
1995, 267, 1275-1276; Oku eta/., 1995, Biochim. Biophys. Acta, 1238, 86-90). The long- 
circulating liposomes enhance the pharmacokinetics and pharmacodynamics of DNA and 
RNA, particularly compared to conventional cationic liposomes which are known to 
accumulate in tissues of the MPS (Liu et a/., J. Biol. Chem. 1995, 42, 24864-24870; Choi et 
ai, International PCT Publication No. WO 96/10391; Ansell et ai, International POT 
Publication No. WO 96/10390; Holland et ai, International PCT Publication No. WO 
96/10392; all of these are incorporated by reference herein). Long-circulating liposomes 
are also likely to protect drugs from nuclease degradation to a greater extent compared to 
cationic liposomes, based on their ability to avoid accumulation in metabolically aggressive 
MPS tissues such as the liver and spleen. All of these references are incorporated by 
reference herein. 

[00123] The present invention also includes compositions prepared for storage or 
administration which include a pharmaceutically effective amount of the desired compounds 
in a pharmaceutically acceptable carrier or diluent. Acceptable carriers or diluents for 



41 



MBHB0(>882-I (400/118) 

therapeutic use are well known in the pharmaceutical art, and are described, for example, in 
Remington's Pharmaceutical Sciences, Mack Publishing Co. (A.R. Gennaro edit. 1985) 
hereby incorporated by reference herein. For example, preservatives, stabilizers, dyes and 
flavoring agents may be provided. These include sodium benzoate, sorbic acid and esters 
of p-hydroxybenzoic acid. In addition, antioxidants and suspending agents may be used. 

[00124] A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence, or treat (alleviate a symptom to some extent, preferably all of the symptoms) of 
a disease state. The pharmaceutically effective dose depends on the type of disease, the 
composition used, the route of administration, the type of mammal being treated, the 
physical characteristics of the specific mammal under consideration, concurrent medication, 
and other factors which those skilled in the medical arts will recognize. Generally, an 
amount between 0.1 mgAg and 100 mgAg body weight/day of active ingredients is 
administered dependent upon potency of the negatively charged polymer. 

[00125] The nucleic acid molecules of the present invention may also be administered to a 
patient in combination with other therapeutic compounds to increase the overall therapeutic 
effect. The use of multiple compounds to treat an indication may increase the beneficial 
effects while reducing the presence of side effects. 

[00126] Alternatively, certain of the nucleic acid molecules of the instant invention (e.g., 
formula IV) can be expressed within cells from eukaryotic promoters (e.g., Izant and 
Weintraub, 1985 Science 229, 345; McGarry and Lindquist, 1986 Proc. Nati. Acad. Sci. 
USA 83, 399; Scanlon et a/., 1991, Proc. Natl. Acad. Sci. USA. 88, 10591-5; Kashani- 
Sabet etai, 1992 An^ense Res. Dev., 2, 3-15; Dropulic eta/., 1992 J. Virol, 66, 1432- 
41; Weerasinghe et ai, 1991 J. Virol, 65, 55314; Ojwang et ai, 1992 Proc. Natl. Acad. 
Sci. USA 89, 10802-6; Chen etai, 1992 Nucleic Acids Res., 20, 4581-9; Sarver etal., 
1990 Science 247, 1222-1225; Thompson etai, 1995 Nucleic Acids Res. 23, 2259; Good 
et a/., 1997, Gene Therapy, 4, 45; all of the references are hereby incorporated in their 
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totality by reference herein). Those skilled in the art realize that any nucleic acid can be 
expressed in eukaryotic cells from the appropriate DNA/RNA vector. The activity of such 
nucleic acids can be augmented by their release from the primary transcript by a ribozyme 
(Draper et al, PCT WO 93/23569, and Sullivan et a/., PCT WO 94/02595; Ohkawa et al, 
1992 Nucleic Acids Symp. Ser., 27, 15-6; Taira et a/., 1991, Nucleic Acids Res., 19, 
5125-30; Ventura et a/., 1993 Nucleic Acids Res., 21, 3249-55; Chowrira et al., 1994 J. 
Biol. Chem. 269, 25856; all of the references are hereby incorporated in their totality by 
reference herein). 

[00127] In another aspect of the invention, RNA molecules of the present invention are 
preferably expressed from transcription units (see for example Couture et al., 1996, TIG., 
12, 510) inserted into DNA or RNA vectors. The recombinant vectors are preferably DNA 
plasmids or viral vectors. Ribozyme expressing viral vectors could be constructed based on, 
but not limited to, adeno-associated virus, retrovirus, adenovirus, or alphavirus. Preferably, 
the recombinant vectors capable of expressing the nucleic acid molecules are delivered as 
described above, and persist in target cells. Alternatively, viral vectors may be used that 
provide for transient expression of nucleic acid molecules. Such vectors might be 
repeatedly administered as necessary. Once expressed, the nucleic acid molecule binds to 
the target mRNA. Delivery of nucleic acid molecule expressing vectors could be systemic, 
such as by intravenous or intra-muscular administration, by administration to target cells ex- 
planted from the patient followed by reintroduction into the patient, or by any other means 
that would allow for introduction into the desired target cell (for a review see Couture et al., 
1996, TIG., 12, 510). 

[00128] In one aspect the invention features, an expression vector comprising nucleic acid 
sequence encoding at least one of the nucleic acid molecules of the instant invention is 
disclosed. The nucleic acid sequence encoding the nucleic acid molecule of the instant 
invention is operable linked in a manner which allows expression of that nucleic acid 
molecule. 
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[00129] In another aspect the invention features, the expression vector comprises: a 
transcription initiation region (e.g., eukaryotic pol I, II or III initiation region); b) a transcription 
termination region (e.g., eukaryotic pol I, II or III termination region); c) a gene encoding at 
least one of the nucleic acid catalyst of the instant invention; and wherein said gene is 
operably linked to said initiation region and said termination region, in a manner which allows 
expression and/or delivery of said nucleic acid molecule. The vector may optionally include 
an open reading frame (ORF) for a protein operably linked on the 5' side or the 3'-side of the 
gene encoding the nucleic acid catalyst of the invention; and/or an intron (intervening 
sequences). 

[00130] Transcription of the nucleic acid molecule sequences are driven from a promoter 
for eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase III (pol 
III). Transcripts from pol II or pol III promoters will be expressed at high levels in all cells; the 
levels of a given pol II promoter in a given cell type will depend on the nature of the gene 
regulatory sequences (enhancers, silencers, etc.) present nearby. Prokaryotic RNA 
polymerase promoters are also used, providing that the prokaryotic RNA polymerase 
enzyme is expressed in the appropriate cells (EIroy-Stein and Moss, 1990 Proc. Natl. Acad. 
Sci. USA, 87, 6743-7; Gao and Huang 1993 Nucleic Acids Res., 21, 2867-72; Lieber et 
al., 1993 Methods EnzymoL, 217, 47-66; Zhou et al., 1990 Mol. Cell. Biol.. 10, 4529-37). 
Several investigators have demonstrated that nucleic acid molecules, such as ribozymes 
expressed from such promoters can function in mammalian cells (e.g. Kashani-Sabet et al., 

1992 Antisense Res. Dev., 2, 3-15; Ojwang et a!., 1992 Proc. Natl. Acad. Sci. USA, 89, 
10802-6; Chen et al., 1992 Nucleic Acids Res., 20, 4581-9; Yu et al., 1993 Proc. Natl. 
Acad. Sci. USA, 90, 63404; L'Huillier etai, 1992 EMBOJ. 11, 4411-8; Lisziewicz etal., 

1993 Proc. Natl. Acad. Sci. U. S. A., 90, 80004; Thompson et al., 1995 Nucleic Acids 
Res. 23, 2259; Sullenger & Cech, 1993, Science, 262, 1566). More specifically, 
transcription units such as the ones derived from genes encoding U6 small nuclear (snRNA), 
transfer RNA (tRNA) and adenovirus VA RNA are useful in generating high concentrations of 
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desired RNA molecules such as ribozymes in cells (Thompson et a/., supra; Couture and 
Stinchcomb, 1996, supra; Noonberg et a/„ 1994, Nucleic Acid Res., 22, 2830; Noonberg 
et a/., US Patent No. 5,624,803; Good et a/., 1997, Gene Ther. 4, 45; Beigelman et a/.. 
International PCT Publication No. WO 96/18736; all of these publications are incorporated 
by reference herein. The above ribozyme transcription units can be incorporated into a 
variety of vectors for introduction into mammalian cells, including but not restricted to, 
plasmid DNA vectors, viral DNA vectors (such as adenovirus or adeno-associated virus 
vectors), or viral RNA vectors (such as retroviral or alphavirus vectors) (for a review see 
Couture and Stinchcomb, 1996, supra). 

[00131] In yet another aspect the invention features an expression vector comprising 
nucleic acid sequence encoding at least one of the nucleic acid molecules of the invention, in 
a manner which allows expression of that nucleic acid molecule. The expression vector 
comprises in one embodiment; a) a transcription initiation region; b) a transcription 
termination region; c) a gene encoding at least one said nucleic acid molecule; and wherein 
said gene is operably linked to said initiation region and said termination region, in a manner 
which allows expression and/or delivery of said nucleic acid molecule. In another preferred 
embodiment the expression vector comprises: a) a transcription initiation region; b) a 
transcription termination region; c) an open reading frame; d) a gene encoding at least one 
said nucleic acid molecule, wherein said gene is operably linked to the 3'-end of said open 
reading frame; and wherein said gene is operably linked to said initiation region, said open 
reading frame and said termination region, in a manner which allows expression and/or 
delivery of said nucleic acid molecule. In yet another embodiment the expression vector 
comprises: a) a transcription initiation region; b) a transcription termination region; c) an 
intron; d) a gene encoding at least one said nucleic acid molecule; and wherein said gene is 
operably linked to said initiation region, said intron and said termination region, in a manner 
which allows expression and/or delivery of said nucleic acid molecule. In another 
embodiment, the expression vector comprises: a) a transcription initiation region; b) a 
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transcription termination region; c) an intron; d) an open reading frame; e) a gene encoding 
at least one said nucleic acid molecule, wherein said gene is operably linked to the 3'-end of 
said open reading frame; and wherein said gene is operably linked to said initiation region, 
said intron, said open reading frame and said termination region, in a manner which allows 
expression and/or delivery of said nucleic acid molecule. 

Examples. 

[00132] The following are non-limiting examples showing the selection, isolation, synthesis 
and activity of nucleic acids of the instant invention. 

[00133] The following examples demonstrate the selection and design of Antisense, 
hammerhead, DNAzyme, NCH, or G-Cleaver ribozyme molecules and binding/cleavage sites 
within TERT RNA. 

Example 1: Identification of Potential Target Sites in Human TERT RNA 
[00134] The sequence of human TERT was screened for accessible sites using a 
computer folding algorithm. Regions of the RNA that did not form secondary folding 
structures and contained potential ribozyme and/or antisense binding/cleavage sites were 
identified. The sequences of these cleavage sites are shown in tables lll-VII. 

Example 2: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human TERT RNA 

[00135] To test whether the sites predicted by the computer-based RNA folding algorithm 

corresponded to accessible sites in TERT RNA, 10 hammerhead ribozyme and three G- 
Cleaver ribozyme sites were selected for further analysis (Table VI). Ribozyme target sites 
were chosen by analyzing sequences of Human TERT (Nakamura et a/., 1997 Science 277, 
955-959; Genbank sequence accession number: NM_003219) and prioritizing the sites on 
the basis of folding. Ribozymes were designed that could bind each target and were 
individually analyzed by computer folding (Christoffersen et a/., 1994 J. Mo/. Struc. 
Theochem, 311, 273; Jaeger eta/., 1989, Proc. Natl. Acad. Sci. USA, 86, 7706) to assess 
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whether the ribozyme sequences fold into the appropriate secondary structure. Those 
ribozymes with unfavorable intramolecular interactions between the binding arms and the 
catalytic core were eliminated from consideration. As noted below, varying binding arm 
lengths can be chosen to optimize activity. Generally, at least 5 bases on each arm are able 
to bind to, or otherwise interact with, the target RNA. 

Example 3: Chemical Synthesis and Purification of Ribozvmes for Efficient Cleavage of TERT 
RNA 

[00136] Ribozymes were designed to anneal to various sites in the RNA message. The 
binding arms are complementary to the target site sequences described above. The 
ribozymes were chemically synthesized. The method of synthesis used followed the 
procedure for normal RNA synthesis as described above and in Usman et al., (1987 J. Am. 
Chem. Soc, 109, 7845), Scaringe et al., (1990 Nucleic Acids Res., 18, 5433) and Wincott 
et al., supra, and made use of common nucleic acid protecting and coupling groups, such as 
dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. The average stepwise 
coupling yields were >98%. 

[00137] Ribozymes were also synthesized from DNA templates using bacteriophage T7 
RNA polymerase (Milligan and Uhlenbeck, 1989, Methods Enzymol. 180, 51). Ribozymes 
were purified by gel electrophoresis using general methods or were purified by high 
pressure liquid chromatography (HPLC; See Wincott et al., supra; the totality of which is 
hereby incorporated herein by reference) and were resuspended in water. The sequences of 
the chemically synthesized ribozymes used in this study are shown below in Table lll-VII. 

Example 4: Ribozvme Cleavage of TERT RNA Target in vitro 

[00138] Ribozymes targeted to the human TERT RNA are designed and synthesized as 
described above. These ribozymes can be tested for cleavage activity in vitro, for example 



47 



MBHB00^82-I (400/118) 

using the following procedure. The target sequences and the nucleotide location within the 
TERT RNA are given in Tables lll-VII. 

[00139] Cleavage Reactions: Full-length or partially full-length, internally-labeled target RNA 
for ribozyme cleavage assay is prepared by in vitro transcription in the presence of [a-32p] 
CTP, passed over a G 50 Sephadex column by spin chromatography and used as substrate 
RNA without further purification. Alternately, substrates are 5'-32p-end labeled using T4 
polynucleotide kinase enzyme. Assays are performed by pre-warming 15 jjI of a 2X 
concentration of purified ribozyme in ribozyme cleavage buffer (50 mM Tris-HCI, pH 7.5 at 
37°C, 10 mM MgCl2) and the cleavage reaction was initiated by adding the 2X ribozyme mix 
to an equal volume (15 pi) of substrate RNA (maximum of 1-5 nM; 5 x 10^ to 1 x 10^ cpm) 
that was also pre-warmed in cleavage buffer. As an initial screen, assays are carried out for 

1 hour at 37°C using a final concentration of either 40 nM or 1 mM ribozyme, i.e., ribozyme 
excess. The reaction is quenched by the addition of an equal volume (30 \A) of 95% 

formamide, 20 mM EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol after which the 

sample is heated to 95°C for 2 minutes, quick chilled and loaded onto a denaturing 
polyacrylamide gel. Substrate RNA and the specific RNA cleavage products generated by 
ribozyme cleavage are visualized on an autoradiograph of the gel. The percentage of 

cleavage is determined by Phosphor Imager® quantitation of bands representing the intact 
substrate and the cleavage products. 

Cell Culture Models 

[00140] Various methods have been developed to assay telomerase activity in vitro. The 
most widely used method to characterize telomerase activity is the telomeric repeat 
amplification protocol (TRAP). TRAP utilizes RT-PCR of cellular extracts to measure 
telomerase activity by making the amount of PCR target dependant upon the biochemical 
activity of the enzyme (Kim, N. W., 1997, Nucleic Acids Research, 25, 2595-2597). 
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[00141] Human cell culture studies have been established to assay inhibition of telomerase 
activity in human carcinomas responding to various therapeutics. A human breast cancer 
model for studying telomerase inhibitors is described (Raymond, E., 1999, Br. J. Cancer, 
80, 1332-1341). Human studies of telomerase expression as related to various other 
cancers are described including cervical cancer (Nakano, K., 1998, Am. J. Pathol, 153, 
857-864), endometrial cancer (Kyo, S., 1999, Int. J. Cancer, 80, 60-63), meningeal 
carcinoma (Kleinschmidt-DeMasters, B. K., 1998, J. Neurol. Sci., 161, 124-134), lung 
carcinoma (Yashima, K., 1997, Cancer Reseach, 57, 2372-2377), testicular cancer in 
response to cisplatin (Burger, A. M., 1997, Eur. J. Cancer, 33, 638-644), and ovarian 
carcinoma (Counter, C. M., 1994, Proc. Natl. Acad. Sci., 91, 2900-2904). 

Animal Models 

[00142] A variety of animal models have been designed to assay telomerase activity in 
vivo. Inhibition of telomerase activity has been analyzed in rats via cell proliferation studies 
with MNU (N-methyl-N-nitosurea) induced mammary carcinomas in response to treatment 
with 4-(hydroxyphenyl)retinamide (4-HPR), a known inhibitor of mammary carcinogenesis in 
animal models and premenopausal women (Bednarek, A., 1999, Carcinogenesis, 20, 879- 
883). The method of Bednarek et al. uses N-methyl-N-nitrosourea (MNU)-induced mammary 
carcinomas in rats to analyze the effect of telomerase inhibitors in vivo. MNU-induced 
tumors express high telomerase activity. Female virgin Sprague-Dawley rats are injected 
twice with MNU (50 mg/kg body weight) at days 43 and 50 days of age. Mammary tumors 
are allowed to grow to 4-8 mm before commencing treatment with an agent, such as 4- 
(hydroxyphenyl) retinamide (used by Bednarek et ai.) or a nucleic acid of the invention being 
tested as a modulator of telomerase activity. Following treatment with an agent for 0 to 6 
weeks, telomerase activity is assayed using the TRAP method on CHAPS-extracted tumor-cell 
protein samples. A decrease of 10% or more in telomerase activity relative to the level in 
tumors of untreated animals indicates an agent is a telomerase inhibitor. Additional studies 
have focused on the up-regulation of telomerase in transformed cell lines from animal and 
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human model systems (Zhang, P. B., 1998, Leuk. Res., 22, 509-516), (Chadeneau, C, 
1995, Oncogene, 11. 893-898), (Greenberg, R., 1999, Oncogene, 18, 1219-1226). 

Indications 

[00143] Particular degenerative and disease states that can be associated with 
telomerase expression modulation include but are not limited to: 

• Cancer: Almost all human tumors have detectable telomerase activity (Shay, J. W., 
1997, Eur. J. Cancer, 33, 787-791). Treatment with telomerase inhibitors may provide 
effective cancer therapy with minimal side effects in normal somatic cells that lack 
telomerase activity. The therapeutic potential exists for the treatment of a wide variety of 
cancer types. 

• Restinosis: Telomerase inhibition in vascular smooth muscle cells may inhibit restinosis 
by limiting proliferation of these cells. 

• Infectious disease: Telomerase inhibition in infectious cell types that express telomerase 
activity may provide selective antibiotic activity. Such treatment may prove especially 
effective in protozoan-based infection such as Giardia and Leishmaniasis. 

• Transolant reiection: Telomerase inhibition in endothelial cell types may demonstrate 
selective immunnosuppressant activity. Activation of telomerase in transplant cells could 
benefit grafting success through increased proliferative potential. 

• Autoimmune disease: Telomerase modulation in various immune cells may prove 
beneficial in treating diseases such as multiple sclerosis, lupus, and AIDS. 

• Age related disease: Activation of telomerase expression in cells at or nearing 
senescence as a result of advanced age or premature aging could benefit conditions 
such as macular degeneration, skin ulceration, and rheumatoid arthritis. 
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[00144] The present body of knowledge in telomerase research indicates the need for 
methods to assay telomerase activity and for compounds that can regulate telomerase 
expression for research, diagnostic, and therapeutic use. 

[00145] Gemcytabine and cyclophosphamide are non-limiting examples of 
chemotherapeutic agents that can be combined with or used in conjunction with the nucleic 
acid molecules (e.g. ribozymes and antisense molecules) of the instant invention. Those 
skilled in the art will recognize that other drugs such as anti-cancer compounds and 
therapies can be similarly be readily combined with the nucleic acid molecules of the instant 
invention (e.g. ribozymes and antisense molecules) and are hence within the scope of the 
instant invention. Such compounds and therapies are well known in the art (see for example 
Cancer: Principles and Pranctice of Oncology, Volumes 1 and 2, eds Devita, V.T., Hellman, 
S., and Rosenberg, S.A., J.B. Lippincott Company, Philadelphia, USA; incorporated herein by 
reference) and include, without limitations, antifolates; fluoropyrimidines; cytarabine; purine 
analogs; adenosine analogs; amsacrine; topoisomerase I inhibitors; anthrapyrazoles; 
retinoids; antibiotics such as bleomycin, anthacyclins, mitomycin C, dactinomycin, and 
mithramycin; hexamethylmelamine; dacarbazine; l-asperginase; platinum analogs; alkylating 
agents such as nitrogen mustard, melphalan, chlorambucil, busulfan, ifosfamide, 4- 
hydroperoxycyclophosphamide, nitrosoureas, thiotepa; plant derived compounds such as 
vinca alkaloids, epipodophyllotoxins, taxol; Tomaxifen; radiation therapy; surgery; nutritional 
supplements; gene therapy; radiotherapy such as 3D-CRT; immunotoxin therapy such as 
ricin, monoclonal antibodies herceptin; and the like. For combination therapy, the nucleic 
acids of the invention are prepared in one of two ways. First, the agents are physically 
combined in a preparation of nucleic acid and chemotherapeutic agent, such as a mixture of 
a nucleic acid of the invention encapsulated in liposomes and ifosfamide in a solution for 
intravenous administration, wherein both agents are present in a therapeutically effective 
concentration (e.g., ifosfamide in solution to deliver 1000-1250 mg/m2/day and liposome- 
associated nucleic acid of the invention in the same solution to deliver 0.1-100 mgAg/day). 
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Alternatively, the agents are administered separately but simultaneously in their respective 
effective doses (e.g., 1000-1250 mg/m2/d ifosfamide and 0.1 to 100 mgAg/day nucleic 
acid of the invention). 

Diagnostic uses 

[00146] The nucleic acid molecules of this invention (e.g., ribozymes) may be used as 
diagnostic tools to examine genetic drift and mutations within diseased cells or to detect the 
presence of TERT RNA in a cell. The close relationship between ribozyme activity and the 
structure of the target RNA allows the detection of mutations in any region of the molecule 
which alters the base-pairing and three-dimensional structure of the target RNA. By using 
multiple ribozymes described in this invention, one may map nucleotide changes which are 
important to RIMA structure and function in vitro, as well as in cells and tissues. Cleavage of 
target RNAs with ribozymes may be used to inhibit gene expression and define the role 
(essentially) of specified gene products in the progression of disease. In this manner, other 
genetic targets may be defined as important mediators of the disease. These experiments 
will lead to better treatment of the disease progression by affording the possibility of 
combinational therapies (e.g., multiple ribozymes targeted to different genes, ribozymes 
coupled with known small molecule inhibitors, or intermittent treatment with combinations of 
ribozymes and/or other chemical or biological molecules). Other in vitro uses of ribozymes 
of this invention are well known in the art, and include detection of the presence of mRNAs 
associated with TERT-related condition. Such RNA is detected by determining the presence 
of a cleavage product after treatment with a ribozyme using standard methodology. 

[00147] In a specific example, ribozymes which can cleave only wild-type or mutant forms 
of the target RNA are used for the assay. The first ribozyme is used to identify wild-type 
RNA present in the sample and the second ribozyme will be used to identify mutant RNA in 
the sample. As reaction controls, synthetic substrates of both wild-type and mutant RNA will 
be cleaved by both ribozymes to demonstrate the relative ribozyme efficiencies in the 
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reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates will also serve to generate size markers for the 
analysis of wild-type and mutant RNAs in the sample population. Thus each analysis will 
require two ribozymes, two substrates and one unknown sample which will be combined into 
six reactions. The presence of cleavage products will be determined using an RNAse 
protection assay so that full-length and cleavage fragments of each RNA can be analyzed in 
one lane of a polyacrylamide gel. For example, the cleavage reactions are performed in 
ribozyme cleavage buffer with a final reaction volume of 30 pi per reaction as follows: 1) 
ribozyme specific for {i.e., that specifically cleaves) wild-type RNA {wt ribozyme; 40 nM final 
reaction concentration) is incubated with wild type substrate (1-5 nM final reaction 
concentration) at 37°C for one hour; 2) wt ribozyme is incubated with mutant substrate 
(same conditions); 3) wt ribozyme (40 nM final concentration) is incubated with 50 pg of total 
RNA from the individual being tested, at 37°C for one hour; 4) same as (1), only with 40 nM 
final concentration of ribozyme specific for mutant RNA; 5) same as (2), only with ribozyme 
specific for mutant RNA; and 6) same as (3), only with ribozyme specific for mutant RNA. 
Cleavage products are precipitated with ethanol and resuspended in 20 pi of hybridization 
buffer for RNAse protection with 5 x lOMo 1 x 10^ cpm of ^^P-labeled RNA probe. 
Hybridization buffer consists of the following (per reaction): 24pl Formamide, 2mI 0.6M 
PIPES, 2.4mI 5M NaCI , 0.3pl O.IM EDTA, and DEPC-treated water to 30 pi. Samples are 
heated at 95°C for 10 minutes, then incubated 4 hours at 55°C (hybridization temperatures 
may be estimated by one of skill in ttie art and optimized empirically for a given probe:target 
combination without undue experimentation). Following hybridization, hybridized sequences 
are digested with ribonucleases by the addition of 350 pi of RNase digestion buffer (300 mM 
NaOAc, 10 mM Tris, 5 mM EDTA) followed by addition of 1 pi of 4mg/ml RNase A and 0.4 
pi of lOu/pl RNase Tl. Digestion is carried out for 45 minutes to 1 hour at 30°C, followed 
by the addition of 10 pi of 20% SDS and 2.5 pi of lOmg/ml Proteinase K. Samples are 
incubated at 37°C for 15-20 minutes followed by phenol/chloroform/isoamyl alcohol 
(25:24:1) extraction and precipitation with ethanol. Samples are resuspended in formamide 
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loading buffer, heat denatured and electrophoresed on a denaturing polyacrylamide gel. 
Protected cleavage products are visualized by autoradiography and quantitated by 
phosphorinnager analysis.lt is not absolutely required to quantify the results to gain insight 
into the expression of mutant RNAs and putative risk of the desired phenotypic changes in 
target cells. The expression of mRNA whose protein product is implicated in the 
development of the phenotype (i.e., TERT) is adequate to establish risk. If probes of 
comparable specific activity are used for both transcripts, then a qualitative comparison of 
RNA levels will be adequate and will decrease the cost of the initial diagnosis. Higher mutant 
form to wild-type ratios will be correlated with higher risk whether RNA levels are compared 
qualitatively or quantitatively. 

Additional Uses 

[00148] Potential usefulness of sequence-specific enzymatic nucleic acid molecules of the 
instant invention might have many of the same applications for the study of RNA that DNA 
restriction endonucleases have for the study of DNA (Nathans et a/., 1975 Ann. Rev. 
Biochem. 44:273). For example, the pattern of restriction fragments could be used to 
establish sequence relationships between two related RNAs, and large RNAs could be 
specifically cleaved to fragments of a size more useful for study. The ability to engineer 
sequence specificity of the enzymatic nucleic acid molecule is ideal for cleavage of RNAs of 
unknown sequence. Applicant describes the use of nucleic acid molecules to down-regulate 
gene expression of target genes in bacterial, microbial, fungal, viral, and eukaryotic systems 
including plant, or mammalian cells. 

[00149] All patents and publications mentioned in the specification are indicative of the 
levels of skill of those skilled in the art to which the invention pertains. All references cited in 
this disclosure are incorporated by reference to the same extent as if each reference had 
been incorporated by reference in its entirety individually. 
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[00150] One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
those inherent therein. The methods and compositions described herein as presently 
representative of preferred embodiments are exemplary and are not intended as limitations 
on the scope of the invention. Changes therein and other uses will occur to those skilled in 
the art, which are encompassed within the spirit of the invention, are defined by the scope of 
the claims. 

[00151] It will be readily apparent to one skilled in the art that varying substitutions and 
modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. Thus, such additional embodiments are within the scope of 
the present invention and the following claims. 

[00152] The invention illustratively described herein suitably may be practiced in the 
absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms "comprising", 
"consisting essentially of" and "consisting of" may be replaced with either of the other two 
terms. The terms and expressions which have been employed are used as terms of 
description and not of limitation, and there is no intention that in the use of such terms and 
expressions of excluding any equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are possible within the scope of the 
invention claimed. Thus, it should be understood that although the present invention has 
been specifically disclosed by preferred embodiments, optional features, modification and 
variation of the concepts herein disclosed may be resorted to by those skilled in the art, and 
that such modifications and variations are considered to be within the scope of this invention 
as defined by the description and the appended claims. 

[00153] In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize that 
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the invention is also thereby described in terms of any individual member or subgroup of 
members of the Markush group or other group. 



[00154] Other embodiments are within the following claims. 
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TABLE I 



Characteristics of naturally occurring ribozymes 
Group I Introns 

Size: -150 to >1000 nucleotides. 

Requires a U in the target sequence immediately 5' of the cleavage site. 
Binds 4-6 nucleotides at the 5'-side of the cleavage site. 

Reaction mechanism: attack by the 3'-0H of guanosine to generate cleavage products with 3'-0H and 5'- 
guanosine. 

Additional protein cofactors required in some cases to help folding and maintainance of the active structure. 
Over 300 known members of this class. Found as an intervening sequence in Tetrahymena thermophila 
rRNA, fungal mitochondria, chloroplasts, phage T4, blue-green algae, and others. 
Major structural features largely established through phylogenetic comparisons, mutagenesis, and 
biochemical studies 

Complete kinetic frameworic established for one ribozyme 

Studies of ribozyme folding and substrate docking undenway f'V'",'^]. 

Chemical modification investigation of important residues well established 

The small (4-6 nt) binding site may make this ribozyme too non-specific for targeted RNA cleavage, however, 
the Tetrahymena group I intron has been used to repair a "defective" p-galactosidase message by the 
ligation of new p-galactosidase sequences onto the defective message p^"]. 

RNAse P RNA (M1 RNA) 

Size: -290 to 400 nucleotides. 

RNA portion of a ubiquitous ribonucleoprotein enzyme. 

Cleaves tRNA precursors to fonm mature tRNA ^% 

Reaction mechanism: possible attack by M^^-OH to generate cleavage products with 3'-0H and 5*- 
phosphate. 

RNAse P is found throughout the prokaryotes and eukaryotes. The RNA subunit has been sequenced from 
bacteria, yeast, rodents, and primates. 

Recruitment of endogenous RNAse P for therapeutic applications is possible through hybridization of an 
External Guide Sequence (EGS) to the target RNA p^,^^] 
Important phosphate and 2* OH contacts recently identified p^*,'^^"] 

Group II Introns 

Size: >1000 nucleotides. 

Trans cleavage of target RNAs recently demonstrated p^'V"]. 
Sequence requirements not fully detenfnined. 

Reaction mechanism: 2'-0H of an internal adenosine generates cleavage products with 3'-0H and a "lariat" 
RNA containing a 3'-5' and a 2'-5* branch point. 
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Only natural ribozyme with denfionstrated participation in DNA cleavage p«,^'] in addition to RNA cleavage 
and ligation. 

Major structural features largely established through phylogenetic comparisons p"]. 
Important 2' OH contacts beginning to be identified p«'"] 
Kinetic framework under development p^'^] 

Neurospora VS RNA 

Size: --144 nucleotides. 

Trans cleavage of hairpin target RNAs recently demonstrated p"^]. 
Sequence requirements not fully detenmined. 

Reaction mechanism: attack by 2'-0H 5' to the scissile bond to generate cleavage products with 2',3'-cyclic 
phosphate and 5'-0H ends. 

Binding sites and structural requirements not fully detennined. 
Only 1 known member of this class. Found in Neurospora VS RNA. 

Hammerhead Ribozyme 

(see text for references) 

Size: -1 3 to 40 nucleotides. 

Requires the target sequence UH immediately 5' of the cleavage site. 
Binds a variable number nucleotides on both sides of the cleavage site. 

Reaction mechanism: attack by 2'-0H 5' to the scissile bond to generate cleavage products with 2',3*-cyclic 
phosphate and 5'-0H ends. 

• 14 known members of this class. Found in a number of plant pathogens (virusoids) that use RNA as the 
infectious agent. 

Essential structural features largely defined, including 2 crystal structures pxvi wviij 
Minimal ligation activity demonstrated (for engineering through in vitro selection) p«viii] 
Complete kinetic frameworic established for two or more ribozymes P««]. 
Chemical modification investigation of important residues well established p^]. 

Hairpin Ribozyme 

Size: ^50 nucleotides. 

Requires the target sequence GUC immediately 3' of the cleavage site. 

Binds 4-6 nucleotides at the 5 -side of the cleavage site and a variable number to the 3*-side of the cleavage 
site. 

Reaction mechanism: attack by 2'-0H 5* to the scissile bond to generate cleavage products with 2',3'-cyclic 
phosphate and 5'-0H ends. 

3 known members of this class. Found in three plant pathogen (satellite RNAs of the tobacco ringspot virus, 
arabis mosaic virus and chicory yellow mottle vims) which uses RNA as the infectious agent. 
Essential structural features largely defined [xxxi mii miii mivj 

Ligation activity (in addition to cleavage activity) makes ribozyme amenable to engineering through in vitro 
selection 

Complete kinetic framewori( established for one ribozyme 

Chemical modification investigation of important residues begun pxxvii mviiij 
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Size: -60 nucleotides. 

Trans cleavage of target RNAs demonstrated 

Binding sites and structural requirements not fully detennined, although no sequences 5' of cleavage site are 
required. Folded ribozyme contains a pseudoknot structure p']. 

Reaction mechanism: attack by 2'-0H 5' to the scissile bond to.generate cleavage products with 2',3*-cyclic 
phosphate and 5'-0H ends. 

Only 2 known members of this class. Found in human HDV. 

Circular form of HDV is active and shows increased nuclease stability [^^] 
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Table li: 0.2 |jmol RNA Synthesis Cycle 



Reagents 


Equivalents 


Amounts (microL) 


Wait time (sec) 


Phosphoramidites 


15 


31 


465 


SET 


38.7 


31 


465 


Acetic anhydride 


655 


124 


5 


N-methyl-imidazole 


1245 


124 


5 


TCA 


700 


732 


10 


Iodine 


20.6 


244 


15 



* Wait time does not include contact time during delivery. 
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Table V: Human telomerase reverse transcriptase (TERT) G-Cleaver Ribozyme and 
Target Sequence 



nt. 
Position 


Substrate Sequence 


SeqID 
Nos 






16 


GCUGCGUCCU G CUGCG 


1454 


CGCAG UGAUGGCAUGCACUAUGCGCG AGGACGCAGC 


4233 


19 


GCGUCCUGCU G CGCAC 


1455 


GUGCG UGAUGGCAUGCACUAUGCGCG AGCAGGACGC 


4234 


21 


GUCCUGCUGC G CACGU 


1456 


ACGUG UGAUGGCAUGCACUAUGCGCG GCAGCAGGAC 


4235 


53 


GGCCACCCCC G CGAUG 


1457 


CAUCG UGAUGGCAUGCACUAUGCGCG GGGGGUGGCC 


4236 


55 


CCACCCCCGC G AUGCC 


1458 


GGCAU UGAUGGCAUGCACUAUGCGCG GCGGGGGUGG 


4237 


58 


CCCCCGCGAU G CCGCG 


1459 


CGCGG UGAUGGCAUGCACUAUGCGCG AUCGCGGGGG 


4238 


61 


CCGCGAUGCC G CGCGC 


1460 


GCGCG UGAUGGCAUGCACUAUGCGCG GGCAUCGCGG 


4239 


63 


GCGAUGCCGC G CGCUC 


1461 


GAGCG UGAUGGCAUGCACUAUGCGCG GCGGCAUCGC 


4240 


65 


GAUGCCGCGC G CUCCC 


1462 


GGGAG UGAUGGCAUGCACUAUGCGCG GCGCGGCAUC 


4241 


72 


CGCGCUCCCC G CUGCC 


1463 


GGCAG UGAUGGCAUGCACUAUGCGCG GGGGAGCGCG 


4242 


75 


GCUCCCCGCU G CCGAG 


1464 


CUCGG UGAUGGCAUGCACUAUGCGCG AGCGGGGAGC 


4243 


78 


CCCCGCUGCC G AGCCG 


1465 


CGGCU UGAUGGCAUGCACUAUGCGCG GGCAGCGGGG 


4244 


85 


GCCGAGCCGU G CGCUC 


1466 


GAGCG UGAUGGCAUGCACUAUGCGCG ACGGCUCGGC 


4245 


87 


CGAGCCGUGC G CUCCC 


1467 


GGGAG UGAUGGCAUGCACUAUGCGCG GCACGGCUCG 


4246 


94 


UGCGCUCCCU G CUGCG 


1468 


CGCAG UGAUGGCAUGCACUAUGCGCG AGGGAGCGCA 


4247 


97 


GCUCCCUGCU G CGCAG 


1469 


CUGCG UGAUGGCAUGCACUAUGCGCG AGCAGGGAGC 


4248 


99 


UCCCUGCUGC G CAGCC 


1470 


GGCUG UGAUGGCAUGCACUAUGCGCG GCAGCAGGGA 


.4249 


111 


AGCCACUACC G CGAGG 


1471 


CCUCG UGAUGGCAUGCACUAUGCGCG GGUAGUGGCU 


4250 


113 


CCACUACCGC G AGGUG 


1472 


CACCU UGAUGGCAUGCACUAUGCGCG GCGGUAGUGG 


4251 


118 


ACCGCGAGGU G CUGCC 


1473 


GGCAG UGAUGGCAUGCACUAUGCGCG ACCUCGCGGU 


4252 


121 


GCGAGGUGCU G CCGCU 


1474 


AGCGG UGAUGGCAUGCACUAUGCGCG AGCACCUCGC 


4253 


124 


AGGUGCUGCC G CUGGC 


1475 


GCCAG UGAUGGCAUGCACUAUGCGCG GGCAGCACCU 


4254 


139 


CCACGUUCGU G CGGCG 


1476 


CGCCG UGAUGGCAUGCACUAUGCGCG ACGAACGUGG 


4255 


144 


UUCGUGCGGC G CCUGG 


1477 


CCAGG UGAUGGCAUGCACUAUGCGCG GCCGCACGAA 


4256 


172 


GGCGGCUGGU G CAGCG 


1478 


CGCUG UGAUGGCAUGCACUAUGCGCG ACCAGCCGCC 


4257 


177 


CUGGUGCAGC G CGGGG 


1479 


CCCCG UGAUGGCAUGCACUAUGCGCG GCUGCACCAG 


4258 


198 


GCGGCUUUCC G CGCGC 


1480 


GCGCG UGAUGGCAUGCACUAUGCGCG GGAAAGCCGC 


4259 


200 


GGCUUUCCGC G CGCUG 


1481 


CAGCG UGAUGGCAUGCACUAUGCGCG GCGGAAAGCC 


4260 


202 


CUUUCCGCGC G CUGGU 


1482 


ACCAG UGAUGGCAUGCACUAUGCGCG GCGCGGAAAG 


4261 


216 


GUGGCCCAGU G CCUGG 


1483 


CCAGG UGAUGGCAUGCACUAUGCGCG ACUGGGCCAC 


4262 


223 


AGUGCCUGGU G UGCGU 


1484 


ACGCA UGAUGGCAUGCACUAUGCGCG ACCAGGCACU 


4263 


225 


UGCCUGGUGU G CGUGC 


1485 


GCACG UGAUGGCAUGCACUAUGCGCG ACACCAGGCA 


4264 


229 


UGGUGUGCGU G CCCUG 


1486 


CAGGG UGAUGGCAUGCACUAUGCGCG ACGCACACCA 


4265 


239 


GCCCUGGGAC G CACGG 


1487 


CCGUG UGAUGGCAUGCACUAUGCGCG GUCCCAGGGC 


4266 


247 


ACGCACGGCC G CCCCC 


1488 


GGGGG UGAUGGCAUGCACUAUGCGCG GGCCGUGCGU 


4267 


254 


GCCGCCCCCC G CCGCC 


1489 


GGCGG UGAUGGCAUGCACUAUGCGCG GGGGGGCGGC 


4268 


257 


GCCCCCCGCC G CCCCC 


1490 


GGGGG UGAUGGCAUGCACUAUGCGCG GGCGGGGGGC 


4269 


270 


CCCUCCUUCC G CCAGG 


1491 


CCUGG UGAUGGCAUGCACUAUGCGCG GGAAGGAGGG 


4270 


277 


UCCGCCAGGU G UCCUG 


1492 


CAGGA UGAUGGCAUGCACUAUGCGCG ACCUGGCGGA 


4271 


282 


CAGGUGUCCU G CCUGA 


1493 


UCAGG UGAUGGCAUGCACUAUGCGCG AGGACACCUG 


4272 


286 


UGUCCUGCCU G AAGGA 


1494 


UCCUU UGAUGGCAUGCACUAUGCGCG AGGCAGGACA 


4273 
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303 


CUGGUGGCCC G AGUGC 


1495 


GCACU UGAUGGCAUGCACUAUGCGCG GGGCCACCAG 


4274 


307 


UGGCCCGAGU G CUGCA 


1496 


UGCAG UGAUGGCAUGCACUAUGCGCG ACUCGGGCCA 


4275 


310 


CCCGAGUGCU G CAGAG 


1497 


CUCUG UGAUGGCAUGCACUAUGCGCG AGCACUCGGG 


4276 


319 


UGCAGAGGCU G UGCGA 


1498 


UCGCA UGAUGGCAUGCACUAUGCGCG AGCCUCUGCA 


4277 


321 


CAGAGGCUGU G CGAGC 


1499 


GCUCG UGAUGGCAUGCACUAUGCGCG ACAGCCUCUG 


4278 


"3 O "5 


GAGGCUGUGC G AGCGC 


1500 


GCGCU UGAUGGCAUGCACUAUGCGCG GCACAGCCUC 


4279 


O O T 


CUGUGCGAGC G CGGCG 


1501 


CGCCG UGAUGGCAUGCACUAUGCGCG GCUCGCACAG 


4280 


332 


CGAGCGCGGC G CGAAG 


1502 


CUUCG UGAUGGCAUGCACUAUGCGCG GCCGCGCUCG 


4281 


334 


7\ ^%^% ^% TV TV ^< TV TV 

AGCGCGGCGC G AAGAA 


1503 


UUCUU UGAUGGCAUGCACUAUGCGCG GCGCCGCGCU 


4282 




CGAAGAACGU G CUGGC 


1504 


GCCAG UGAUGGCAUGCACUAUGCGCG ACGUUCUUCG 


4283 


O d o 


CUUCGGCUUC G CGCUG 


1505 


CAGCG UGAUGGCAUGCACUAUGCGCG GAAGCCGAAG 


4284 




UCGGCUUCGC G CUGCU 


1506 


AGCAG UGAUGGCAUGCACUAUGCGCG GCGAAGCCGA 


4285 


-i b 4 


CjUJUCGCGCU g cugga 


1507 


UCCAG UGAUGGCAUGCACUAUGCGCG AGCGCGAAGC 


4286 


1 "I Q 

J /o 


galogggccc g cgggg 


1508 


CCCCG UGAUGGCAUGCACUAUGCGCG GGGCCCCGUC 


4287 


TOT 
J y 4S 


GGGCCCCCCC G AGGCC 


1509 


GGCCU UGAUGGCAUGCACUAUGCGCG GGGGGGGCCC 


4288 




LCACCAGCGu G CGCAG 


1510 


CUGCG UGAUGGCAUGCACUAUGCGCG ACGCUGGUGG 


4289 


414 


ALLAbLbUCjC b CAGCU 


1511 


AGCUG UGAUGGCAUGCACUAUGCGCG GCACGCUGGU 


4290 


424 


/"•/^ TV /"< /^TT TV /"'/^T T /—I /—I /—I 7\ TV 

GCAGLuACCU G CCCAA 


1512 


UUGGG UGAUGGCAUGCACUAUGCGCG AGGUAGCUGC 


4291 


4 J 6 


TV TV TV /^/^/^T T TV TV 

CCAACACGGU G ACCGA 


1513 


UCGGU UGAUGGCAUGCACUAUGCGCG ACCGUGUUGG 


4292 


440 


CACGGUGACC G ACGCA 


1514 


UGCGU UGAUGGCAUGCACUAUGCGCG GGUCACCGUG 


4293 


44 3 


GGUGACCGAC G CACUG 


1515 


CAGUG UGAUGGCAUGCACUAUGCGCG GUCGGUCACC 


4294 


44 8 


CCGACGCACU G CGGGG 


1516 


CCCCG UGAUGGCAUGCACUAUGCGCG AGUGCGUCGG 


4295 


472 


CGUGGGGGCU G CUGCU 


1517 


AGCAG UGAUGGCAUGCACUAUGCGCG AGCCCCCACG 


4296 


475 


GGGGGCUGCU G CUGCG 


1518 


CGCAG UGAUGGCAUGCACUAUGCGCG AGCAGCCCCC 


4297 


4 /o 


GGCUGC-UGCU G CGCCG 


1519 


CGGCG UGAUGGCAUGCACUAUGCGCG AGCAGCAGCC 


4298 


4 80 


CUGCUGCUGC G CCGCG 


1520 


CGCGG UGAUGGCAUGCACUAUGCGCG GCAGCAGCAG 


4299 


/I O 


LUGCUGCGCC G CGUGG 


1521 


CCACG UGAUGGCAUGCACUAUGCGCG GGCGCAGCAG 


•4300 


491 


CCGCGUGGGC G ACGAC 


1522 


GUCGU UGAUGGCAUGCACUAUGCGCG GCCCACGCGG 


4301 


494 


CGUGGGCGAC G ACGUG 


1523 


CACGU UGAUGGCAUGCACUAUGCGCG GUCGCCCACG 


4302 


4 y y 


GCGACGACGU G CUGGU 


1524 


ACCAG UGAUGGCAUGCACUAUGCGCG ACGUCGUCGC 


4303 


bll 


UGGUUCACCU G CUGGC 


1525 


GCCAG UGAUGGCAUGCACUAUGCGCG AGGUGAACCA 


4304 


C 1 Q 

biy 


CUGCUGGCAC G CUGCG 


1526 


CGCAG UGAUGGCAUGCACUAUGCGCG GUGCCAGCAG 


4305 




✓Tr TO 0 0 TV 00 OT T ^ /^y^j^^j^ 

CUGGCACGCU G CGCGC 


1527 


GCGCG UGAUGGCAUGCACUAUGCGCG AGCGUGCCAG 


4306 


C O A 
b Z 4 


Or^r^ A O/^/^TT/^*^ 0 00 OT TO 

GGLACGCUGL G CGCUC 


1528 


GAGCG UGAUGGCAUGCACUAUGCGCG GCAGCGUGCC 


4307 


o c 


UACGLUGCGL G CUCUU 


1529 


AAGAG UGAUGGCAUGCACUAUGCGCG GCGCAGCGUG 


4308 


bo J 


f^r^ r^r^ OT TOT TT TT T O t TO OT TO 

v_GL.GLULUUU G UGCUG 


1530 


CAGCA UGAUGGCAUGCACUAUGCGCG AAAGAGCGCG 


4309 


c n c 
b J b 


CGCUCUUUGU G CUGGU 


1531 


ACCAG UGAUGGCAUGCACUAUGCGCG ACAAAGAGCG 


4310 


ceo 
bbz 


OOT TOOO H OOT T O OO/^OTT 

GCUCCCAGCU G CGCCU 


1532 


AGGCG UGAUGGCAUGCACUAUGCGCG AGCUGGGAGC 


4311 


554 


UCCCAGCUGC G CCUAC 


1533 


GUAGG UGAUGGCAUGCACUAUGCGCG GCAGCUGGGA 


4312 


565 


CCUACCAGGU G UGCGG 


1534 


CCGCA UGAUGGCAUGCACUAUGCGCG ACCUGGUAGG 


4313 


567 


UACCAGGUGU G CGGGC 


1535 


GCCCG UGAUGGCAUGCACUAUGCGCG ACACCUGGUA 


4314 


574 




1 c 1 c 
lb JO 


fK.f^r^t^f^ T TO TV T TOOO A T TOO A OT TTV T TO 0000 /'^/'^/'^i^/^t^/'^l^, /"^tii. 

UGAUGGLAUGCACUAUGCGCG GGCCCGCACA 


4315 


577 


GCGGGCCGCC G CUGUA 


1537 


UACAG UGAUGGCAUGCACUAUGCGCG GGCGGCCCGC 


4316 


580 


GGCCGCCGCU G UACCA 


1538 


UGGUA UGAUGGCAUGCACUAUGCGCG AGCGGCGGCC 


4317 


593 


CCAGCUCGGC G CUGCC 


1539 


GGCAG UGAUGGCAUGCACUAUGCGCG GCCGAGCUGG 


4318 


596 


GCUCGGCGCU G CCACU 


1540 


AGUGG UGAUGGCAUGCACUAUGCGCG AGCGCCGAGC 


4319 


616 


CCCGGCCCCC G CCACA 


1541 


UGUGG UGAUGGCAUGCACUAUGCGCG GGGGGCCGGG 


4320 
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^ O "3 


CCCGCCACAC G CUAGU 


1542 


AOTTAO TTO ATTOOOATTOOAOTTATTOOOOO OTTOTTOOOOOO 

ACUAG UGAUGGCAUGCACUAUGCGCG GUGUGVjL.OGG 




636 


AGUGGACCCC G AAGGC 


1543 


OOOTTTT TTOATTOOO ATTOO A OTTATTOOOOO OOOOTTOOAOTT 

GCCUU UGAuGGLAUGCALUAUGCGCG GGGGULLAL.U 


/I 1 o o 


651 


CGuCUGGGAU G CGAAC 


1544 


/^T TT T/^/^ T T/^ TV T T/^ /*1 IV T TV ^^T TTV T T/*l /"^/^ TV T T/^ TV TV 

GUUCG UGAUGGCAUGCACUAUGCGCG AUCCCAGACG 


4323 


653 


UCUGGGAUGC G AACGG 


1545 


/^f^T TT T T T/^ TV T T/^ TV T T/^ TV /^T T TV T Ti^ /^/^ /^/"i TV T T/^/^^^TV TV 

CCGUU UGAUGGCAUGCACUAUGCGCG GCAUCCCAGA 


4324 


703 


CCCUGGGCCU G CCAGC 


1546 


GCUGG UGAUGGCAUGCACUAUGCGCG AGGCCCAGGG 


4325 


716 


AGCCCCGGGU G CGAGG 


1547 


CCUCG UGAUGGCAUGCACUAUGCGCG ACCCGGGGCU 


4326 


718 


CCCCGGGUGC G AGGAG 


1548 


CUCCU UGAUGGCAUGCACUAUGCGCG GCACCCGGGG 


4327 


726 


GCGAGGAGGC G CGGGG 


1549 


CCCCG UGAUGGCAUGCACUAUGCGCG GCCUCCUCGC 


4328 


737 


CGGGGGCAGU G CCAGC 


1550 


^/^T^O TTO TV T TOOO TV TTO O TV or T1V TTO O^O/^ TV ^TT/^^^^OOO 

GCUGG UGAUGGCAUGCACUAUGCGCG ACUGCCCCCG 


4329 


744 


TV /^T T/^ ^ TV /~« ^ TVTV/^TT/^ 

AGUGCCAGCC G AAGUC 


1551 


TV /^T TT T T T/^ TV T T/^ TV T T/^ TV T TV T T/^ /^A^ /^T T/^ TV /^T T 

GACUU UGAUGGCAUGCACUAUGCGCG GGCUGGCACU 


433 0 


751 


/^/*^/*10 TV TV T/^TT /^/— 1/-«T TT T 

GCCGAAGUCU G CCGUU 


1552 


TV TV T T/^ TV T T/~t/~t/^ TV T T/~l /"I TV /^T TTV T T/^ /^/^ y^/^ TV TV /^T TT T/^/^/~1 /"I 

AACGG UGAUGGCAUGCACUAUGCGCG AGACUUCGGC 


4331 


757 


GUCUGCCGUU G CCCAA 


1553 


UUGGG UGAUGGCAUGCACUAUGCGCG AACGGCAGAC 


4332 


779 


CAGGCGuGGC G CUGCC 


1554 


GGCAG UGAUGGCAUGCACUAUGCGCG GCCACGCCUG 


4333 


782 


GCGUGGCGCU G CCCCU 


1555 


AGGGG UGAUGGCAUGCACUAUGCGCG AGCGCCACGC 


4334 


788 


CGCUGCCCCU G AGCCG 


1556 


CGGCU UGAUGGCAUGCACUAUGCGCG AGGGGCAGCG 


4335 


802 


CGGAGCGGAC G CCCGU 


1557 


ACGGG UGAUGGCAUGCACUAUGCGCG GUCCGCUCCG 


4336 


841 


CGGGCAGGAC G CGUGG 


1558 


CCACG UGAUGGCAUGCACUAUGCGCG GUCCUGCCCG 


4337 


850 


CGCGUGGACC G AGUGA 


1559 


UCACU UGAUGGCAUGCACUAUGCGCG GGUCCACGCG 


4338 


OCA 

ob4 


UGGACCGAGU G ACCGU 


1560 


TV /^/^/^T T T T/^ TV T T/^y^/^ TV T T/^y^ TV y^T TTV T T/^ A^/^ /^/^ TV /^T T/^^ Ti^/*1 TV 

ACGGU UGAUGGCAUGCACUAUGCGCG ACUCGGUCCA 


4339 


OD / 


OOT TOOT TTTT TOT T O T TOT TOO 

C.GUGGUUUCU G UGUGG 


1561 


^i^l TV ^< Ti TT^I TV TT^I^I^ITV T T^*iy*^ TV /^\J TTV T Ti^ ^^^^ TV TV TV TV y^/^ TV 

CCACA UGAUGGCAUGCACUAUGCGCG AGAAACCACG 


4340 


O C Q 

ooy 


T TOOT TTTTTOT TOT T O T TOOT TO 

UGGUUUCUGU G UGGUG 


1562 


^ITV ^1^1 Ti TT^I TV T T^l ^1 TV TT^^^^TV y^T TTV TT^^^^^I^^I TV ^1 TV ^1 TV TV TV ^I^ITV 

CACCA UGAUGGCAUGCACUAUGCGCG ACAGAAACCA 


4341 


874 


UCUGUGUGGU G UCACC 


1563 


GGUGA UGAUGGCAUGCACUAUGCGCG ACCACACAGA 


4342 


881 


GGUGUCACCU G CCAGA 


1564 


UCUGG UGAUGGCAUGCACUAUGCGCG AGGUGACACC 


*4343 


890 


UGCCAGACCC G CCGAA 


1565 


UUCGG UGAUGGCAUGCACUAUGCGCG GGGUCUGGCA 


4344 


893 


CAGACCCGCC G AAGAA 


1566 


UUCUU UGAUGGCAUGCACUAUGCGCG GGCGGGUCUG 


4345 


917 


UUUGGAGGGU G CGCUC 


1567 


GAGCG UGAUGGCAUGCACUAUGCGCG ACCCUCCAAA 


4346 


919 


UGGAGGGUGC G CUCUC 


1568 


GAGAG UGAUGGCAUGCACUAUGCGCG GCACCCUCCA 


4347 


931 


T TOT TOT ^ TV ^ /^/^/^/^TV 

UCUCUGGCAC G CGCCA 


1569 


UGGCG UGAUGGCAUGCACUAUGCGCG GUGCCAGAGA 


4348 


93 3 


UCUGGCACGC G CCACU 


1570 


AGUGG UGAUGGCAUGCACUAUGCGCG GCGUGCCAGA 


4349 


957 


UCCGUGGGCC G CCAGC 


1571 


GCUGG UGAUGGCAUGCACUAUGCGCG GGCCCACGGA 


4350 


968 


TV Tl /"I TV ^^^^^^^^ 

CCAGCACCAC G CGGGC 


1572 


GCCCG UGAUGGCAUGCACUAUGCGCG GUGGUGCUGG 


4351 


988 


CAUCCACAUC G CGGCC 


1573 


GGCCG UGAUGGCAUGCACUAUGCGCG GAUGUGGAUG 


4352 


1012 


CCUGGGACAC G CCUUG 


1574 


CAAGG UGAUGGCAUGCACUAUGCGCG GUGUCCCAGG 


4353 


1017 


GACACGCCUU G UCCCC 


1575 


GGGGA UGAUGGCAUGCACUAUGCGCG AAGGCGUGUC 


4354 


1027 


GUCCCCCGGU G UACGC 


1576 


GCGUA UGAUGGCAUGCACUAUGCGCG ACCGGGGGAC 


4355 


1031 


CCCGGUGUAC G CCGAG 


1577 


CUCGG UGAUGGCAUGCACUAUGCGCG GUACACCGGG 


4356 


1034 


GGUGUACGCC G AGACC 


1578 


GGUCU UGAUGGCAUGCACUAUGCGCG GGCGUACACC 


4357 


1064 


CUCCUCAGGC G ACAAG 


1579 


CUUGU UGAUGGCAUGCACUAUGCGCG GCCUGAGGAG 


4358 


1078 


AGGAGCAGCU G CGGCC 


1580 


GGCCG UGAUGGCAUGCACUAUGCGCG AGCUGCUCCU 


4359 


1105 


UCAGCUCUCU G AGGCC 


1581 


GGCCU UGAUGGCAUGCACUAUGCGCG AGAGAGCUGA 


4360 


1117 


GGCCCAGCCU G ACUGG 


1582 


CCAGU UGAUGGCAUGCACUAUGCGCG AGGCUGGGCC 


4361 


1124 




1 C Q 1 
± D O J 


OOOAO TTO ATTOOO ATTOO A OTTATTOOOOO OOOAOTIOAOO 

L.L.ljA*a LHjAUGGUAUGCAL-UAUGCGLG GL.CAGUL.AGG 


4362 


1171 


GGCCCUGGAU G CCAGG 


1584 


CCUGG UGAUGGCAUGCACUAUGCGCG AUCCAGGGCC 


4363 


1185 


GGGACUCCCC G CAGGU 


1585 


ACCUG UGAUGGCAUGCACUAUGCGCG GGGGAGUCCC 


4364 


1192 


CCCGCAGGUU G CCCCG 


1586 


CGGGG UGAUGGCAUGCACUAUGCGCG AACCUGCGGG 


4365 


1197 


AGGUUGCCCC G CCUGC 


1587 


GCAGG UGAUGGCAUGCACUAUGCGCG GGGGCAACCU 


4366 


1201 


UGCCCCGCCU G CCCCA 


1588 


UGGGG UGAUGGCAUGCACUAUGCGCG AGGCGGGGCA 


4367 
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1 O A Q 


CUGCCCCAGC G CUACU 


1589 


TV T TV T Ty^ TV T Ty^ ^1 TV T IF^i y^ n ^IT T TV T T/^ /^/^ y^^*i y"i /^t y^ -pv y^ 

AGUAG UGAUGGCAUGCACUAUGCGCG GCUGGGGCAG 


4368 


1 O T T 


ALUGGCAAAU G CGGCC 


1590 


GGCCG UGAUGGCAUGCACUAUGCGCG AUUUGCCAGU 


4369 


TOOT 


UGLGGCCCCU G UUUCU 


1591 


AGAAA UGAUGGCAUGCACUAUGCGCG AGGGGCCGCA 


4370 




UUCUGGAGCU G CUUGG 


1592 


/^y*YTV TV T TV T Ty^ ^ y^ TV f Ty^ y^ tv y^T t tv t t/^ y^y^y^y*i tv y^ y**iT Ty^y*^ tv y*^ tv tv 

CCAAG UGAUGGCAUGCACUAUGCGCG AGCUCCAGAA 


4371 


1 o c t 


UGGGAACCAC G CGCAG 


1593 


CUGCG UGAUGGCAUGCACUAUGCGCG GUGGUUCCCA 


4372 


1258 


GGAACCACGC G CAGUG 


1594 


CACUG UGAUGGCAUGCACUAUGCGCG GCGUGGUUCC 


4373 




CACGCGCAGU G CCCCU 


1595 


AGGGG UGAUGGCAUGCACUAUGCGCG ACUGCGCGUG 


4374 


1276 


CCUACGGGGU G CuCCU 


1596 


AGGAG UGAUGGCAUGCACUAUGCGCG ACCCCGUAGG 


4375 


1288 


UCCUCAAGAC G CACUG 


1597 


CAGUG UGAUGGCAUGCACUAUGCGCG GUCUUGAGGA 


4376 


1293 


AAGACGCACU G CCCGC 


1598 


GCGGG UGAUGGCAUGCACUAUGCGCG AGUGCGUCUU 


4377 


1297 


CGCACUGCCC G CUGCG 


1599 


CGCAG UGAUGGCAUGCACUAUGCGCG GGGCAGUGCG 


4378 


1 o n A 
1300 


ACUGCCCGCU G CGAGC 


1600 


GCUCG UGAUGGCAUGCACUAUGCGCG AGCGGGCAGU 


4379 


13 02 


UGCCCGCUGC G AGCUG 


1601 


CAGCU UGAUGGCAUGCACUAUGCGCG GCAGCGGGCA 


4380 


13 07 


GCUGCGAGCU G CGGUC 


1602 


GACCG UGAUGGCAUGCACUAUGCGCG AGCUCGCAGC 


4381 


1328 


AGCAGCCGGU G UCUGU 


1603 


ACAGA UGAUGGCAUGCACUAUGCGCG ACCGGCUGCU 


4382 


133z 


GCCGGUGUCU G UGCCC 


1604 


GGGCA UGAUGGCAUGCACUAUGCGCG AGACACCGGC 


4383 


13 34 


CGGUGUCUGU G CCCGG 


1605 


CCGGG UGAUGGCAUGCACUAUGCGCG ACAGACACCG 


4384 


13 58 


CCAGGGCUCU G UGGCG 


1606 


CGCCA UGAUGGCAUGCACUAUGCGCG AGAGCCCUGG 


4385 


13 /O 


GGCGGCCCCC G AGGAG 


1607 


CUCCU UGAUGGCAUGCACUAUGCGCG GGGGGCCGCC 


4386 


1 Q Q C 

13^b 


GACCCCCGUC G CCUGG 


1608 


CCAGG UGAUGGCAUGCACUAUGCGCG GACGGGGGUC 


4387 


140z 


GUCbLLUGGfU G CAGCU 


1609 


AGCUG UGAUGGCAUGCACUAUGCGCG ACCAGGCGAC 


4388 


T yi r» Q 


UGGUGLAGLU G CUCCG 


1610 


CGGAG UGAUGGCAUGCACUAUGCGCG AGCUGCACCA 


4389 


±*k 13 


A /^T T/~l /-^T T/-i /-I /-I /~) /~1 7\ /— < /— 1 

LAGLUGLULL G LCAGC 


1611 


GCUGG UGAUGGCAUGCACUAUGCGCG GGAGCAGCUG 


4390 


143o 


CCUGGCAGGU G UACGG 


1612 


CCGUA UGAUGGCAUGCACUAUGCGCG ACCUGCCAGG 


4391 


1450 


ACGGCUUCGU G CGGGC 


1613 


GCCCG UGAUGGCAUGCACUAUGCGCG ACGAAGCCGU 


4392 


1 /I c o 

14 bo 


GUGCGGGCCU G CCUGC 


1614 


GCAGG UGAUGGCAUGCACUAUGCGCG AGGCCCGCAC 


4393 


1462 


GGGCCUGCCU G CGCCG 


1615 


CGGCG UGAUGGCAUGCACUAUGCGCG AGGCAGGCCC 


4394 


1464 


GCCUGCCUGC G CCGGC 


1616 


GCCGG UGAUGGCAUGCACUAUGCGCG GCAGGCAGGC 


4395 


1474 


GCCGGCUGGU G CCCCC 


1617 


GGGGG UGAUGGCAUGCACUAUGCGCG ACCAGCCGGC 


4396 


lb 05 


TV TV ^ TV TV TV /^^^ 

CAGGCACAAC G AACGC 


1618 


GCGUU UGAUGGCAUGCACUAUGCGCG GUUGUGCCUG 


4397 


1509 


^ TV TV /^/^ TV TV /~t /^y^ T 

CACAACGAAC G CCGCU 


1619 


AGCGG UGAUGGCAUGCACUAUGCGCG GUUCGUUGUG 


4398 


1512 


TV TV TV TV ^^rr tt T/^y^ 

AACGAACGCC G CUUCC 


1620 


GGAAG UGAUGGCAUGCACUAUGCGCG GGCGUUCGUU 


4399 


Ibbo 


y^y^/^ TV TV y^ TV T T ^^^^Tvivy^ 

GGGGAAGCAU G CCAAG 


1621 


CUUGG UGAUGGCAUGCACUAUGCGCG AUGCUUCCCC 


4400 


Ibo / 


/^/^TV TV /^T T^^TT^ /^T Ty^ TV 

CCAAGCUCUC G CUGCA 


1622 


UGCAG UGAUGGCAUGCACUAUGCGCG GAGAGCUUGG 


4401 


1570 


AGCUCUCGCU G CAGGA 


1623 


UCCUG UGAUGGCAUGCACUAUGCGCG AGCGAGAGCU 


4402 


1579 


UGCAGGAGCU G ACGUG 


1624 


CACGU UGAUGGCAUGCACUAUGCGCG AGCUCCUGCA 


4403 


icon 

lb y 1 


/^/^TT/^^^TV TV y^ TV T T ^1 ^ ^^y^y^TT 

CGUGGAAGAU G AGCGU 


1625 


ACGCU UGAUGGCAUGCACUAUGCGCG AUCUUCCACG 


4404 


15 97 


TV TV T xy*^ TV ^^y^^^TT y*<i y^ t*. 

AGAUGAGCGU G CGGGA 


1626 


UCCCG UGAUGGCAUGCACUAUGCGCG ACGCUCAUCU 


4405 


T ^ A C 

16 05 


/^T T/^ ^^^^>y^ TV T y^ y^y^ y^TTTT 

GUGCGGGACU G CGCUU 


1627 


AAGCG UGAUGGCAUGCACUAUGCGCG AGUCCCGCAC 


4406 


1 C A T 

160 / 


GCGGGACUGC G CUUGG 


1628 


CCAAG UGAUGGCAUGCACUAUGCGCG GCAGUCCCGC 


4407 


1615 


GCG CUUGG CU G CGCAG 


1629 


CUGCG UGAUGGCAUGCACUAUGCGCG AGCCAAGCGC 


4408 


1617 




1 <^ o 
J. D J U 


nr^r^nc* ttoatt/^o/^attooa/^ttatt/^/^/^^/^ /^oa/^o/^a a/^o 
ULL.UG UGAUGGLAUGLALUAUGCGCG GCAGCCAAGC 


4409 


1638 


GGGGUUGGCU G UGUUC 


1631 


GAACA UGAUGGCAUGCACUAUGCGCG AGCCAACCCC 


4410 


1640 


GGUUGGCUGU G UUCCG 


1632 


CGGAA UGAUGGCAUGCACUAUGCGCG ACAGCCAACC 


4411 


1649 


UGUUCCGGCC G CAGAG 


1633 


CUCUG UGAUGGCAUGCACUAUGCGCG GGCCGGAACA 


4412 


1663 


AGCACCGUCU G CGUGA 


1634 


UCACG UGAUGGCAUGCACUAUGCGCG AGACGGUGCU 


4413 


1667 


CCGUCUGCGU G AGGAG 


1635 


CUCCU UGAUGGCAUGCACUAUGCGCG ACGCAGACGG 


4414 
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1690 


CCAAGUUCCU G CACUG 


1636 


/^TV/^TT/*l T Ty*11V T TV T T/^ TV T TV T T/^ /^/^ TV A TV ^WTTT/^^ 

CAGUG UGAUGGCAuGCACUAUGCGCG AGGAACUUGG 


4415 


1699 


T T/^ TV /^T T /*! TV T T/^ TV 

UGCACUGGCU G AUGAG 


1637 


T^ TV T T T T/^ TV T T/^ TV T T/^/^ TV T TV T T/^ /^^^ f^f^ TV ^f^f^ TV /^T TV 

CUCAU UGAUGGCAUGCACUAUGCGCG AGCCAGUGCA 


4416 


1702 


ACUGGCUGAU G AGUGU 


1638 


ACACU UGAUGGCAUGCACUAUGCGCG AUCAGCCAGU 


4417 


1705 


GCUGAUGAGU G UGUAC 


1639 


GUACA UGAUGGCAUGCACUAUGCGCG ACUCAUCAGC 


4418 


1708 


UGAUGAGUGU G UACGU 


1640 


A/^/^TTA TT/^ ATTO/^OATT/^/^A OTTATTOOO/^/^ A A /^T T/^ A TTO A 

ACGUA UGAUGGCAUGCACUAUGCGCG ACACUCAUCA 


4419 


1718 


GUACGULGUC G AGLUG 


1641 


CAGCU UGAUGGCAUGCACUAUGCGCG GALGACGUAC 


4420 


1723 


UCGUCGAGCU G CUCAG 


1642 


CUGAG UGAUGGCAUGCACUAUGCGCG AGCUCGACGA 


4421 


1742 


UUUCXnjUUAU G UCACG 


1643 


/^/^TTi^ A TT^ A TT^^^ A TT^^ A ^^TA TT^^^^^ ATTAAAA^AAA 

CGUGA UGAUGGCAUGCACUAUGCGCG AUAAAAGAAA 


4422 


17 9 3 


CCGGAAGAGU G UCUGG 


1644 


f^/^ TV TV T T/^ TV T T/^/^ TV T T/^ TV TTV TT/*1 TV /^T T^*^TTTT/^^/^^ 

CCAGA UGAUGGCAUGCACUAUGCGCG ACUCUUCCGG 


4423 


1807 


/-^ TV /-^ -jv TV TT T TV TV TV 

GGAGCAAGUU G CAAAG 


1645 


CUUUG UGAUGGCAUGCACUAUGCGCG AACUUGCUCC 


4424 


1834 


TV TV TV /^T TT T TV TV TV 

GACAGCACUU G AAGAG 


1646 


CUCUU UGAUGGCAUGCACUAUGCGCG AAGUGCUGUC 


4425 


1843 


TT^ TV TV TV ^^^TT ^ /^TV^/'^T 

UGAAGAGGGU G CAGCU 


1647 


AGCUG UGAUGGCAUGCACUAUGCGCG ACCCUCUUCA 


4426 


184 9 


GGGUGLAGLU G CGGGA 


164 8 


UCCCG UGAUGGCAUGCACUAUGCGCG AGCUGCACCC 


4427 


1 o c o 
lob o 


UGLGGGAGCU G UCGGA 


164 9 


UCCGA UGAUGGCAUGCACUAUGCGCG AGCUCCCGCA 


4428 


1898 


AGCCAGGCCC G CCCUG 


1650 


TV /^/^ T T/^ TV T T/^ TV T T/^ TV /^T T TV T T/^ /^/^ /^/-^ /^y^T T^^/^i^T T 

CAGGG UGAUGGCAUGCACUAUGCGCG GGGCCUGGCU 


4429 


1 Q n T 
19U J 


GGCCCGCCCU G CUGAC 


1651 


/^TT/^A/^ T T/** A T T/^ A T A /^T T A T T/^ /^/^ TV /"I /"I /"I /^/^y^y"t 

GUCAG UGAUGGCAUGCACUAUGCGCG AGGGCGGGCC 


4430 


1 Q n t 
±y\JO 


CLGCCCUGGU G ACGUC 


1652 


TV T T T/^ TV T T/^ ^ TV T T/^ /*1 TV j^T T TV T T/^ /^/^ Tv /I TV /"I f~^f~^ r^r^ 

GACGU UGAUGGCAUGCACUAUGCGCG AGCAGGGCGG 


44 31 


1 Q O A 

19z U 


ULCAGACUCC G CUUCA 


1653 


UGAAG UGAUGGCAUGCACUAUGCGCG GGAGUCUGGA 


4432 


193 7 


CCCCAAGCCU G ACGGG 


1654 


/-I /^/^T T T T/^ TV TT/^/^ TV T T/^/^ TV /^T T TV T T/^ /"^^^ /^/^ -yv /"l ^T TT T/^ 

CCCGU UGAUGGCAUGCACUAUGCGCG AGGCUUGGGG 


4433 


1945 


CUGACGGGCU G CGGCC 


1655 


GGCCG UGAUGGCAUGCACUAUGCGCG AGCCCGUCAG 


4434 


1951 


GGCUGCGGCC G AUUGU 


1656 


TV ^1 TV TV T T T T^^ TV TT^t^K^ TV T T^I^ITV ^tT T TV T T^l /^/^ /<-i/-^ /— ( 'n /-^ /-«• 

ACAAU UGAUGGCAUGCACUAUGCGCG GGCCGCAGCC 


4435 


195 5 


GCGGCCGAUU G UGAAC 


1657 


GUUCA UGAUGGCAUGCACUAUGCGCG AAUCGGCCGC 


4436 


1957 


/^/^ /^/^/^ TV T TTT/^T T TV TV <^ TV T T 

GGCCGAUUGU G AACAU 


1658 


TV T T/^T TT T T T^^ TV TT/^/^ TV T T/^ TV TTV T T^ /^.^ /^/^ TV TV TV TT/^/^ 

AUGUU UGAUGGCAUGCACUAUGCGCG ACAAUCGGCC 


4437; 


1 Q Q O 


AGAACGUUCC G CAGAG 


1659 


CUCUG UGAUGGCAUGCACUAUGCGCG GGAACGUuCU 


4438 


o n Q 


AAAGAGGGCC G AGCGU 


1660 


ACGCU UGAUGGCAUGCACUAUGCGCG GGCCCUCUUU 


4439 




GUL.UCACCUC G AGGGU 


1661 


TV /^T T T T/^ TV T T/^ /*! TV T T/^ TV /*1T TTV T T/^ TV /^T T/^ TV TV 

ACCCU UGAUGGCAUGCACUAUGCGCG GAGGUGAGAC 


4440 


T r\ T Q 


ULULGAGGGU G AAGGL 


1662 


/^T TT T T T/^ TV T T/^ TV T T/*l TV /^T T TV T T/^ TV /~t /^T T /^/^ TV 

GCCUU UGAUGGCAUGCACUAUGCGCG ACCCUCGAGG 


4441 




UGAAGGCACU G UUCAG 


1663 


^•T T^l TV TV T T^ TV T T^l^l ^1 TV TT^^ ^ TV ^ITTTV TT^^ TV T^l ^I^IT W T^l TV 

CUGAA UGAUGGCAUGCACUAUGCGCG AGUGCCUUCA 


4442 


204 7 


T T/^TTTT/^TV ^^^TT /TT TV TV 

UGUUCAGCGU G CUCAA 


1664 


UUGAG UGAUGGCAUGCACUAUGCGCG ACGCUGAACA- 


4443 


2 057 


GCUCAACUAC G AGCGG 


1665 


CCGCU UGAUGGCAUGCACUAUGCGCG GUAGUUGAGC 


4444 


2065 


ACGAGCGGGC G CGGCG 


1666 


CGCCG UGAUGGCAUGCACUAUGCGCG GCCCGCUCGU 


4445 


2070 


CGGGCGCGGC G CCCCG 


1667 


CGGGG UGAUGGCAUGCACUAUGCGCG GCCGCGCCCG 


4446 


2087 


CCUCCUGGGC G CCUCU 


1668 


AGAGG UGAUGGCAUGCACUAUGCGCG GCCCAGGAGG 


4447 


20 93 


GGGCGCCUCU G UGCUG 


1669 


CAGCA UGAUGGCAUGCACUAUGCGCG AGAGGCGCCC 


4448 


o r» Q c 


GCGCCUCUGU G CUGGG 


1670 


CCCAG UGAUGGCAUGCACUAUGCGCG ACAGAGGCGC 


4449 


ZX\Jo 


^O/^/^^TTO^ A ATTATT/^ 

GGGCCUGGAC G AUAUC 


1671 


TV TTTV TT T T^l ^ T T/^^ ^1 TV y T^ ^ TV ^IT TTV T T^^ ^^^^^1^^ ^IT T^^^l TV ^^^^ ^^^^^^ 

GAUAU UGAUGGCAUGCACUAUGCGCG GUCCAGGCCC 


4450 


212 7 


AGGGCCUGGC G CACCU 


1672 


AGGUG UGAUGGCAUGCACUAUGCGCG GCCAGGCCCU 


4451 


2137 


TV /^/*^T TT T/~1/^T T /^T T/^ /^/^ 

GCACCUUCGU G CuGCG 


1673 


CGCAG UGAUGGCAUGCACUAUGCGCG ACGAAGGUGC 


4452 


2140 


CCUUCGUGCU G CGUGU 


1674 


ACACG UGAUGGCAUGCACUAUGCGCG AGCACGAAGG 


4453 


2144 


CGUGCUGCGU G UGCGG 


1675 


CCGCA UGAUGGCAUGCACUAUGCGCG ACGCAGCACG 


4454 


2146 


UGCUGCGUGU G CGGGC 


1676 


GCCCG UGAUGGCAUGCACUAUGCGCG ACACGCAGCA 


4455 


^ J. O J. 




±D / / 


GGCGG UGAUGGCAUGCACUAUGCGCG GGGUCCUGGG 


4456 


2164 


AGGACCCGCC G CCUGA 


1678 


UCAGG UGAUGGCAUGCACUAUGCGCG GGCGGGUCCU 


4457 


2168 


CCCGCCGCCU G AGCUG 


1679 


CAGCU UGAUGGCAUGCACUAUGCGCG AGGCGGCGGG 


4458 


2173 


CGCCUGAGCU G UACUU 


1680 


AAGUA UGAUGGCAUGCACUAUGCGCG AGCUCAGGCG 


4459 


2180 


GCUGUACUUU G UCAAG 


1681 


CUUGA UGAUGGCAUGCACUAUGCGCG AAAGUACAGC 


4460 


2192 


CAAGGUGGAU G UGACG 


1682 


CGUCA UGAUGGCAUGCACUAUGCGCG AUCCACCUUG 


4461 
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AVjCsUGGAUGU G ACGGG 


loo 3 


OOOOTT TTO ATTOOOATTOOA OTTATTOOOOO AOATTOOAOOTT 

CCCGU UGAUGGCAUGCACUAUGCGCG ACAUCCACCU 


44o2 


O O A T 


UGUGACGGGC G CGUAC 


1684 


/^TTTVi^/~l T T/^ TV TTOO 0 TV T TO TV OT TTV T TO 00 00 O OOOOT TO TV O TV 

GUACG UGAUGGCAUGCACUAUGCGCG GCCCGUCACA 


44 63 


O T A 
ZZ\J / 


GGGL.GLGUAC G ALACC 


loob 


ooTTOTT TTO ATTOOOATTOOA OTTATTOOOOO r*m^ /~*r*/~*r*r^r^/^ 

GGUGU UGAUGGCAUGCACUAUGCGCG GUACGCGCCC 


4464 


2243 


r^r^ TV ^/^T TO TV T TO O TV O 

GGAGGUCAUC G CCAGC 


1686 


OOTTOO TTO A T TOO 0 TV TTOO TV OTTTV TTO 00 00 0 A TTO TV OOTTOO 

GCUGG UGAUGGCAUGCACUAUGCGCG GAUGACCUCC 


4465 


2274 


IV TV 07V OOTTIV OTT O OOTTOO 

AACACGUACU G CGUGC 


1687 


GCACG UGAUGGCAUGCACUAUGCGCG AGUACGUGUU 


4466 


2278 


CGUACUGCGU G CGUCG 


1688 


TV /^/^ T T/*lt T\ TTi^^ TV T Ti^ "IV /*^T T TV T TV TV V^T T^ /^/^ 

CGACG UGAUGGCAUGCACUAUGCGCG ACGCAGUACG 


4467 


2288 


GCGUCGGUAU G CCGUG 


1689 


CACGG UGAUGGCAUGCACUAUGCGCG AUACCGACGC 


4468 




O 0007VTT 

CCAGAAGGCC G CCCAU 


1690 


TVTTOOO TTO TV TTOOOTV TTOOTV OTTTV TT^^^^^ ^^^y"WTT 

AUGGG UGAUGGCAUGCACUAUGCGCG GGCCUUCUGG 


4469 


O T O O 

A ji Z Z 


T\ r^"^ ir^ /^a7\/^o 
GCjCjLACCjULL b LAAGG 


16 91 


OOTTTTO TTO TV TTOOO TV TTOOTV OTTTV TTO^O^^ ^O TV y^^^T T^<^^^ 

CCUUG UGAUGGCAUGCACUAUGCGCG GGACGUGCCC 


4470 


O "2 C O 

z J b J 


T TOT T/^T T A Or^T TT T /~* A07\07\ 

ULULUALLUU G ALAGA 


1692 


T TOT TOT T T TO A TTOO O TV T TO O TV OT T TV T T/~t OO TV TV /^/^T TTV ^ TV ^ TV 

UCUGU UGAUGGCAUGCACUAUGCGCG AAGGUAGAGA 


44 71 


O "5 T >l 


A OOOOTT7V 07V TT O OOAOA 

AGCCGUACAU G CGACA 


1693 


T T^*1 T T^^^ T T/^ TV T T/*^ TV T T/**l TV /^T T TV T T/^ y^/^ TV T Trf^T T TV ^ rf^T T 

UGUCG UGAUGGCAUGCACUAUGCGCG AUGUACGGCU 


4472 


O O T ^ 


OOOTTAOATTOO O AOAOTT 

CCGUACAUGC G ACAGU 


1694 


TV T^^T T T TV TT^I^I ^1 TV T TV ^IT TTV T T^l ^%^t ^1^1 ^1 ^ITV T T^^T TTV ^1^1 ^1 

ACUGU UGAUGGCAUGCACUAUGCGCG GCAUGUACGG 


4473 


O "3 Q (T 


UGGLUCACCU G CAGGA 


16 95 


UCCUG UGAUGGCAUGCACUAUGCGCG AGGUGAGCCA 


4474 


2410 


AOAOOTVOOOO OT T/^ TV ^ 

AGAGLAGLCC G CUGAG 


1696 


/^T T/^ TV T T/^ Ti T T/^ ^ TV T T/^ ^ TV /^T T TV T ^ ^ /"I /*! /^T T/*( /^T T /^T T 

CUCAG UGAUGGCAUGCACUAUGCGCG GGGCUGGUCU 


4475 


2413 


CCAGCCCGCU G AGGGA 


1697 


UCCCU UGAUGGCAUGCACUAUGCGCG AGCGGGCUGG 


4476 




OOTTO a 000 A IT 0 OOOTTO 

GGUGAGGGAU G GCGUC 


1598 


f^T^r^^f^ TTO A TTOOOTV TTOOTV OT TTV TT/^/^/^^^ TV T T^rf^^W T^ TV 

GACGG UGAUGGCAUGCACUAUGCGCG AUCCCUCAGC 


4477 


O A "5 O 


OOTTOOTTO ATTO O AOOAO 


1699 


CUGCU UGAUGGCAUGCACUAUGCGCG GAUGACGACG 


4 4 78 


O /I il Q 

z 4 ^ y 


r'/^TTOOTTOOOTT O AATTOA 

GGUGGUGGLU G AAUGA 


1700 


TTOATTTT T TO A T TOO O TV T TO O TV OT T TV T T<^ OO ^/~» TV TV /"» TV /™< 

ULAUU UGAUGGCAUGCACUAUGCGCG AGGGAGGAGC 


44 79 


o /I c "a 


OTTOOOTTO A A IT o is.r^r^c^r* 
LULGLUGAAU G AGGCC 


1701 


T T TV T T/^ TV T T/*1 TV /^T T TV T Tj^ /^/~% ^^/~% TV T TT T TV ^> TV 

GGCCU UGAUGGCAUGCACUAUGCGCG AUUCAGGGAG 


4480 


24/4 


TTOOOOT TOT TITO O AOOTTO 

UGGGLUGUUG G ACGUC 


1702 


^1 TV ^l^(T T t" T^t TV T T^^/*l TV T TV ^IT TTV TT^I ^1^1 ^l^f ^1 TV TV TV TV 

GACGU UGAUGGCAUGCACUAUGCGCG GAAGAGGCCA 


4481 


24o / 


OTTOTTTTOOTTA 0 O OTTTTOA 

GUGUUCCUAC G CUUCA 


1703 


UGAAG UGAUGGCAUGCACUAUGCGCG GUAGGAAGAC 


44 82 


24 94 


TTA OOOT TTTO TV TT O TTOOOTV 

UACGCUUCAU G UGCCA 


1704 


UGGCA ^ UGAUGGCAUGCACUAUGCGCG AUGAAGCGUA 


4483 


2496 


00 OT TT TO A T TOT T O OOTV^O 

CGCUUCAUGU G CCACC 


1705 


GGUGG UGAUGGCAUGCACUAUGCGCG ACAUGAAGCG 


4484 


1 C A /I 

2504 


OT TO 00 A 00 A 0 0 OOOT TO 

GUGCCACCAC G CCGUG 


1706 


CACGG UGAUGGCAUGCACUAUGCGCG GUGGUGGCAC 


4485 


O C A O 


AOOAOOOOOTT O OOO ATT 

ACCACGCCGU G CGCAU 


1707 


AUGCG UGAUGGCAUGCACUAUGCGCG ACGGCGUGGU 


4486 


2 oil 


/^TV /^/-»/^0/^TT/^/~( /~i OATTOA 

LALGLCGUGC G CAUCA 


1708 


UGAUG UGAUGGCAUGCACUAUGCGCG GCACGGCGUG 


4487 


o c "a Q 
2 bo o 


TTA OOTTOO AOTT o r^r*T<r^r^ 
UALGUGGAGU G GGAGG 


1709 


^/^TT^^/^ T T/^ TV T T/^ TV T T^^ ^ TV ^T T TV T i^^^ rf^/^ TV y^TT^^rf^Tl rf^^TTT* 

CCUGG UGAUGGCAUGCACUAUGCGCG ACUGGACGUA 


4488 


2 bb J. 


AGGGGAUGGL G LAGGG 


1710 


^■^l^tTT^I T T^^ TV T T/*l TV T T^l^l TV ^tt TTV Y T^l^^^t^l^ ^^^^A^ TV T T^^^^^^^W T 

CCCUG UGAUGGCAUGCACUAUGCGCG GGGAUCCCCU 


4489 


2 b / 2 


T TOOT TOT TOO A 0 O OTTOOTT 

UGGUUUGGAC G CUGCU 


1711 


AGCAG UGAUGGCAUGCACUAUGCGCG GUGGAGAGGA 


4490 


O C T C 

2 b / b 


T tot to 0 A 00 OT T 0 OT TOT TO 

UGuGLACGCU G CUCUG 


1712 


CAGAG UGAUGGCAUGCACUAUGCGCG AGCGUGGAGA 


4491 


O C Q A 

2 bo U 


AOOOTTOOTTOTT O r^Tyr^f^f^ 

AGGGUGLULU G GAGCC 


1713 


GGCUG UGAUGGCAUGCACUAUGCGCG AGAGCAGCGU 


4492 


2bo / 


TTOTTOOAOOOTT O TTOOTTA 

UGUGCAGCCU G UGCUA 


1714 


T TTV ^1 TV T TV T T^l TV T T^l ^1 TV ^tf T TV T «k v^^i tv TV 

UAGCA UGAUGGCAUGCACUAUGCGCG AGGCUGCAGA 


4493 


"0 C Q Q 

2 bo y 


TTOO A OOOTTOTT O OTTAOO 

UGLAGCCUGU G CUACG 


1715 


CGUAG UGAUGGCAUGCACUAUGCGCG ACAGGCUGCA 


4494 


o c Q r? 
2 b y / 


OTTOOTT A 0000 0 AOATTO 

GUGGUACGGC G ACAUG 


1716 


CAUGU UGAUGGCAUGCACUAUGCGCG GCCGUAGCAC 


4495 


2 o X4 


AOAAOAAOOTT O TTTTTTOO 

AGAALAAGLU G UUUGC 


1717 


GCAAA UGAUGGCAUGCACUAUGCGCG AGCUUGUUCU 


4496 


20 ± O 


0 A A 0 OT TOT TT TT T o t^r*r*r*r* 
GAAGGUGUUU G GGGGG 


1718 


CCCCG UGAUGGCAUGCACUAUGCGCG AAACAGCUUG 


4497 


O /I 1 

2 b4 ± 


r^r*r*7\ r^/~*r^/~*r^Ti 0 ottoott 
GGGAGGGGCu G CUCCU 


1719 


AGGAG UGAUGGCAUGCACUAUGCGCG AGCCCGUCCC 


4498 


2o4 / 


OOOT toot toot T O OOTTTTTT 

GGGUGGUCCU G CGuUU 


1720 


AAACG UGAUGGCAUGCACUAUGCGCG AGGAGCAGCC 


4499 


o £r A 


TTT TTTOOT TOO A T T O ATTTTTTr~« 

UUUGGUGGAU G AUUUC 


1721 


GAAAU UGAUGGCAUGCACUAUGCGCG AUCCACCAAA 


4500 


O f £ Q 


ATTO ATTTTTTOTTTT O TTTTOOTT 

AUGAUUUCUU G UUGGU 


1722 


ACCAA UGAUGGCAUGCACUAUGCGCG AAGAAAUCAU 


4501 


2674 


UCUUGUUGGU G ACACC 


1723 


GGUGU UGAUGGCAUGCACUAUGCGCG ACCAACAAGA 


4502 


2693 




1 794. 

J. / ^fi 


TTTTTTOO TTO AT TOO O AT TOO A OTTA T TO OO OO OTTOOOTTOAOO 

UUUL.L> ULjAUGGGAUGCACUAUGCGLG gugggugagg 


4503 


2695 


UCACCCACGC G AAAAC 


1725 


GUUUU UGAUGGCAUGCACUAUGCGCG GCGUGGGUGA 


4504 


2721 


ACCCUGGUCC G AGGUG 


1726 


CACCU UGAUGGCAUGCACUAUGCGCG GGACCAGGGU 


4505 


2726 


GGUCCGAGGU G UCCCU 


1727 


AGGGA UGAUGGCAUGCACUAUGCGCG ACCUCGGACC 


4506 


2732 


AGGUGUCCCU G AGUAU 


1728 


AUACU UGAUGGCAUGCACUAUGCGCG AGGGACACCU 


4507 


2742 


GAGUAUGGCU G CGUGG 


1729 


CCACG UGAUGGCAUGCACUAUGCGCG AGCCAUACUC 


4508 
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2749 


GCUGCGUGGU G AACUU 


1 T ^ A 

1730 


AAOTTTT TTO ATT/?!OOaTTOOAOTT7VTTOOOOO R/^i^aOrT'ar'O 

AAGUU UGAUGGCAUGV-AL.UAUGCGL.G AuvJAuvjL.AvjL. 


43U9 


1 T C C 

1 /bb 


UbbUCjjAACUU G CCjGAA 


1/31 


TTTTOOO TTO ATTOOO ATTOOA OTTATTOOOOO AAOTTTTOAOOA 

UUCCG UGAuGGGAUGLAGUAUGGGGG AAGUUGAGLA 




2770 


AGACAGUGGU G AACUU 


1 A 

1732 


AAOTTTT TTO ATTOOOATTOOA OTTATTOOOOO A OO A OTTOT TOT T 

AAGUU UGAUGGGAUGLAGUAUGGGLG ALGAL UGUGU 


4bll 


2780 


GAACUUCCCU G UAGAA 


1733 


TTTTOTTA T TO A T TOO O A T TOO A OT T A T TO OO OO AOOOAAOTTTTO 

UUCUA UGAUGGCAUGCACUAUGCGCG AGGGAAGUUC 


4512 


2789 


UGUAGAAGAC G AGGCC 


1734 


^^/^ /^T T T T^^ TV T T/^ TV T T^l TV ^IT TTV T T^l /^/^ /-< ^ITT^IT T^IT T TV TV 

GGCCU UGAUGGCAUGCACUAUGCGCG GUCUUCUACA 


4513 


2813 


CACGGCUUUU G UUCAG 


1735 


TV TV T T/*^ TV T T/^/^ TV T T/^ TV /^T T TV T T/*^ A^/^ TV TV TV TV /^/^/^T T/^ 

CUGAA UGAUGGCAUGCACUAUGCGCG AAAAGCCGUG 


4514 


2821 


UUGUUCAGAU G CCGGC 


1736 


^ T T/*l TV T T/*t TV T T/^ TV /*^T T TV T T/^ f^/^ TV T T/^t T/^ TV TV TV TV 

GCCGG UGAUGGCAUGCACUAUGCGCG AUCUGAACAA 


4515 




UUCCCCUGGU G CGGCC 


1737 


rM~\r\f^f* TTO ATTOOO ATTOOA OTTATTOOOOO AOOAOOOOAA 

GGCCG UGAUGGCAUGCACUAUGCGCG ACCAGGGGAA 


4516 


2854 


GGUGCGGCCU G CUGCU 


1738 


AOOAO TTO ATTOOO ATTOOA OTTATTOOOOO K f^r*r*f^r*r^1\. r*f^ 

AGCAG UGAUGGCAUGCACUAUGCGCG AGGCCGCACC 


4517 


2 857 


GCGGCCUGCU G CUGGA 


173 9 


TTOOAO T TO A TTOOO A TTOO A OTT A TTOOOOO T^. /^r^T^ f^r* f^r^r^f^ 

UCCAG UGAUGGCAUGCACUAUGCGCG AGCAGGCCGG 


4518 


TOOT 


CCCUGGAGGU G CAGAG 


1740 


OTTOT TO TTO ATTOOO ATTOOA OTT A TTOOOOO AOOTTOOAOOO 

CUCUG UGAUGGCAUGCACUAUGCGCG ACCULCAGGG 


>l C 1 A 

4519 


O O O D 


GGUGCAGAGC G ACUAC 


1741 


OTTAOTT TTO ATTOOO A TTOO A OTTA TTOOOOO OOTTOTTOOAOO 

GUAGU UGAUGGCAUGCACUAUGCGCG GCUCUGCACL 


4520 




/^T^o A^/^TATT F^r^r^r^r^ 
CUCCAGCUAU G CCCGG 


1742 


r^r^r*r*r^ tto at tooo a ttoo a ott a ttooooo attaoottooao 
CCGGG UGAUGGCAUGCACUAUGCGCG AUAGCUGGAG 


4521 




ALCUUCAACC G CGGCU 


1743 


Ti.r'*r>r>r> tto attoooattooa ottattooooo ootttto a aoott 

AGCCG UGAUGGCAUGCACUAUGCGCG GGUUGAAGGU 


4522 


2965 


GGAGGAACAU G CGUCG 


1744 


f^f^ TV T T/^ TV T Tf^f^ TV T T^^/^TV /*1T T TV T T^^^^/^^^^^ TV T T/^T TT T/*1/*TT T^^/^ 

CGACG UGAUGGCAUGCACUAUGCGCG AUGUUCCUCC 


4523 


2970 


A A AT TO /^/^T TO O OA A AO 

AACAUGCGUC G CAAAC 


1745 


OTTTTTTO tto a TTOOO ATTOO a OTT a T TOOOOO O a OOO a ttotttt 

GUUUG UGAUGGCAUGCACUAUGCGCG GACGCAUGUU 


4524 


2 9 8 9 


T TT T/~l /^/^OT T/^T TT T O OO OOTT 

UUGGGGUCUU G CGGCU 


1746 


TV /~1 T T/^ TV T TOO O A T TOO A OT T A T T/^ /^/^ /^/^ TV TV IV /^/^/^/^TV A 

AGCCG UGAUGGCAUGCACUAUGCGCG AAGACCCCAA 


4525 


2 995 


T T/~1T TT TOO/^O OT T O A A OT T/~^ 

UCUUG CGGCU G AAGUG 


1747 


TV i^TTT T T T/^ TV T T/^ TV T T/^ TV /^T T TV T T/^ i^/^ /^/^ TV /^/^ TV TV TV 

CACUU UGAUGGCAUGCACUAUGCGCG AGCCGCAAGA 


4526 


3000 


OOOOTTOAAOTT O TTOAOA 

CGGCUGAAGU G UCACA 


1748 


TTOTTOA TTO A TTOOOA TTOO A OTT A TTOOOOO A OT TT TO TV O OOO 

UGUGA UGAUGGCAUGCACUAUGCGCG ACUUCAGCCG 


4527 


3010 


OTTOAOAOOOTT O TTTTTTOTT 

GUCACAGCCU G UUUCU 


1749 


AOAAA TTO A TTOOO A TTOO A OTTTV T TOOOOO TV OOOT TOT TO TV O 

AGAAA UGAUGGCAUGCACUAUGCGCG AGGCUGUGAC 


4528 




T TT T/^T A T TT TT T /^7\/^OTT 

UUCUGGAUuU G CAGGU 


175 0 


ACCUG UGAUGGCAUGCACUAUGCGCG AAAUCCAGAA 


4 52 9 


"3 A O 


A T TT TT T/^/~< A OOTT A A A O 

AUUUGGAGGU G AACAG 


1751 


OT TOT TT T T TO A T TO O O TV T TO O A OT TAT TO f~S^^ r^f^ TV O OT T007V TV 7VTT 

CUGUU UGAUGGCAUGCACUAUGCGCG ACCUGCAAAU 


453 0 


3 046 


TTOOAOAOOOTT O T TO O TV O 

UCCAGACGGU G UGCAC 


1752 


/^TT^y^TV T TV T T/^^^^ TV TT/^ TV /^T T TV T T^^ /"li^ ^^/^ TV /^V^T T/^T T/*l ^ TV 

GUGCA UGAUGGCAUGCACUAUGCGCG ACCGUCUGGA 


4531 


O A >l O 

3048 


OAOAOOOTTOTT O OAOOA 

CAGACGGUGU G CACCA 


1753 


T T^^^^T T^^ T T^^ TV T T^^^^^^TV T ^\ TV ^^TTTV TT^I^I^I^I^I TV ^\ TV ^1 ^l^tT T^fT T^l 

UGGUG UGAUGGCAUGCACUAUGCGCG ACACCGUCUG 


4532 


3073 


AGAUCCUCCU G CUGCA 


1754 


UGCAG UGAUGGCAUGCACUAUGCGCG AGGAGGAUCU 


4533 


3 076 


UCCUCCUGCU G CAGGC 


1755 


GCCUG UGAUGGCAUGCACUAUGCGCG AGCAGGAGGA 


4534 


3 095 


CAGGUUUCAC G CAUGU 


1756 


ACAUG UGAUGGCAUGCACUAUGCGCG GUGAAACCUG 


4535 . 


3099 


UUUCACGCAU G UGUGC 


1757 


GCACA UGAUGGCAUGCACUAUGCGCG AUGCGUGAAA 


4536 


3101 


T TO A OO O A T TOT TOT TO OT TO 

UCACGCAUGU G UGCUG 


1758 


CAGCA UGAUGGCAUGCACUAUGCGCG ACAUGCGUGA 


4537 


3 103 


A OOO A T TOT TOT T O OTTO OA 

ACGCAUGUGU G CUGCA 


1759 


T T^^ TV ^ T T^ TV t T^/^ TV T T^ TV V^T T TV T T/*l i^/^ TV TV TV T T/^/^/^T T 

UGCAG UGAUGGCAUGCACUAUGCGCG ACACAUGCGU 


4538 


"3 T A ^ 

3106 


OAT TOT TOT TO OT T O O A O OT T 

CAUGUGUGCU G CAGCU 


1760 


TV /^T TA*^ T T/*l TV T T^^/~1 TV T T/^ TV ^T T TV T T/^ /^/^ /^/^ TV TV TV TV T T/^ 

AGCUG UGAUGGCAUGCACUAUGCGCG AGCACACAUG 


4539 


3 lb4 


O ATTTTTTTTTTOOTT O OOOOTT 

CAUUUUUCCU G CGCGU 


1761 


tkr^f^f^r^ TTO A TTOOO A TTOO A OTTA TTOOOOO AOOAAAAATTO 

ACGCG UGAUGGCAUGCACUAUGCGCG AGGAAAAAUG 


4540 


3156 


TTTTT TTTTTOOTTO O O OOTTO A 

UUUUUCCUGC G CGUCA 


1762 


r T/^ TV y^/^ T T^^ TV T T/^ ^\ TV T T^l ^1 TV ^IT T TV T T^l ^t^t fV ^1 ^1 TV TV TV TV TV 

UGACG UGAUGGCAUGCACUAUGCGCG GCAGGAAAAA 


4541 


316/ 


OOITO ATTOTTOTT O AOAOO 

CGUCAUCUCU G ACACG 


1763 


/^^^T T/^T T TT/*1 TV T T^^ ^ TV T T/^ Ti /^T T TV T T/^ /^/^ TV TV TV T T^ TV 

CGUGU UGAUGGCAUGCACUAUGCGCG AGAGAUGACG 


4542 


3183 


O OOTTOOOT TOT T O OTTTVj^TT 

GCCUCCCUCU G CUACU 


1764 


AGUAG UGAUGGCAUGCACUAUGCGCG AGAGGGAGGC 


4543 


3196 


TV ^^T T^l^l^ T T^I^IT T m TV TV ^^^^ 

ACuCCAuCCU G AAAGC 


1765 


GCUUU UGAUGGCAUGCACUAUGCGCG AGGAUGGAGU 


4544 


3 209 


TV TV TV ^ T^ TV ^1 TV ^^^^^^ 

AGCCAAGAAC G CAGGG 


1766 


CCCUG UGAUGGCAUGCACUAUGCGCG GUUCUUGGCU 


4545 


3217 


TV /^/^ TV /^^^ TV T T ^ TT/^/^/^TT 

ACGCAGGGAU G UCGCU 


1767 


AGCGA UGAUGGCAUGCACUAUGCGCG AUCCCUGCGU 


4546 


3 220 


TV /^/^/^ TV T T/^T T/^ ^^TT/^/^/^ 

CAGGGAUGUC G CUGGG 


1768 


CCCAG UGAUGGCAUGCACUAUGCGCG GACAUCCCUG 


4547 


3236 


/^/^/-lOA Aoooo o r^r^r> oo 

GGCCAAGGGC G CCGCC 


1769 


^^l^^^l^l T T^l TV r T^l^ ^1 TV T T.^ ^ TV ^tT T TV T ^ ^T TT 

GGCGG UGAUGGCAUGCACUAUGCGCG GCCCUUGGCC 


4548 


O A O A 

3239 


CAAGGGCGCC G CCGGC 


1770 


GCCGG UGAUGGCAUGCACUAUGCGCG GGCGCCCUUG 


4549 


3250 




T '7 T 1 
±111. 


vj/ioLrVj ULjAULtVjLAUGGALUAUGLGLG AGAGGGLLGG 


/I C C A 

4 bbU 


3257 


UCUGCCCUCC G AGGCC 


1112 


GGCCU UGAUGGCAUGCACUAUGCGCG GGAGGGCAGA 


4551 


3265 


CCGAGGCCGU G CAGUG 


1773 


CACUG UGAUGGCAUGCACUAUGCGCG ACGGCCUCGG 


4552 


3274 


UGCAGUGGCU G UGCCA 


1774 


UGGCA UGAUGGCAUGCACUAUGCGCG AGCCACUGCA 


4553 


3276 


CAGUGGCUGU G CCACC 


1775 


GGUGG UGAUGGCAUGCACUAUGCGCG ACAGCCACUG 


4554 


3292 


AAGCAUUCCU G CUCAA 


1776 


UUGAG UGAUGGCAUGCACUAUGCGCG AGGAAUGCUU 


4555 
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3301 


UGCUCAAGCU G ACUCG 


1777 


CGAGU UGAUGGCAUGCACUAUGCGCG AGCUUGAGCA 


4556 


3306 


AAGCUGACUC G ACACC 


1778 


GGUGU UGAUGGCAUGCACUAUGCGCG GAGUCAGCUU 


4557 


3314 


UCGACACCGU G UCACC 


1779 


/^/*1TT^*1TV 7 T/*1 TV T T/^/^ TV T T^^/^ TV /^T T TV T T/^ /^/^ TV /^/^ /^T T/^T T/*l/*t TV 

GGUGA UGAUGGCAUGCACUAUGCGCG ACGGUGUCGA 


4558 


3325 


UCACCUACGU G CCACU 


1780 


TV /^TT/^/^ TT/^ TV TT/^/^^TV TT/^/^ TV /^TTTV TT/^/^/^ /^/^ TV /^/^TTTV /^/^T T/^ TV 

AGUGG UGAUGGCAUGCACUAUGCGCG ACGUAGGUGA 


4559 


3358 


CAGCCCAGAC G CAGCU 


1781 


AGCUG UGAUGGCAUGCACUAUGCGCG GUCUGGGCUG 


4560 


3364 


AGACGCAGCU G AGUCG 


1782 


CGACU UGAUGGCAUGCACUAUGCGCG AGCUGCGUCU 


4561 


3385 


UCCCGGGGAC G ACGCU 


1783 


AGCGU UGAUGGCAUGCACUAUGCGCG GUCCCCGGGA 


4562 


3388 


CGGGGACGAC G CUGAC 


1784 


GUCAG UGAUGGCAUGCACUAUGCGCG GUCGUCCCCG 


4563 


3391 


GGACGACGCU G ACUGC 


1785 


GCAGU UGAUGGCAUGCACUAUGCGCG AGCGUCGUCC 


4564 


3395 


GACGCUGACU G CCCUG 


1786 


CAGGG UGAUGGCAUGCACUAUGCGCG AGUCAGCGUC 


4565 


34 07 


CCUGGAGGCC G CAGCC 


1787 


GGCUG UGAUGGCAUGCACUAUGCGCG GGCCUCCAGG 


4566 


3424 


ACCCGGCACU G CCCUC 


1788 


GAGGG UGAUGGCAUGCACUAUGCGCG AGUGCCGGGU 


4567 


3 4 5 3 


AUCCUGGACU G AUGGC 


1789 


GCCAU UGAUGGCAUGCACUAUGCGCG AGUCCAGGAU 


4568 


3464 


AUGGC CACCC G CCCAC 


1790 


GUGGG UGAUGGCAUGCACUAUGCGCG GGGUGGCCAU 


4569 


34 79 


T\ TV /~1 TV TV 

CAGCCAGGCC G AGAGC 


1791 


GCUCU UGAUGGCAUGCACUAUGCGCG GGCCUGGCUG 


4570 


3501 


CAGCAGCCCU G UCACG 


1792 


CGUGA UGAUGGCAUGCACUAUGCGCG AGGGCUGCUG 


4571 


3 5 06 


GCCCUGUCAC G CCGGG 


1793 


CCCGG UGAUGGCAUGCACUAUGCGCG GUGACAGGGC 


4572 


3 5 54 


ACCCAGGCCC G CACCG 


1794 


CGGUG UGAUGGCAUGCACUAUGCGCG GGGCCUGGGU 


4573 


3559 


GGCCCGCACC G CUGGG 


1795 


CCCAG UGAUGGCAUGCACUAUGCGCG GGUGCGGGCC 


4574 


3570 


CUGGGAGUCU G AGGCC 


1796 


GGCCU UGAUGGCAUGCACUAUGCGCG AGACUCCCAG 


4575 


3577 


UCUGAGGCCU G AGUGA 


1797 


UCACU UGAUGGCAUGCACUAUGCGCG AGGCCUCAGA 


4576 


3581 


AGGCCUGAGU G AGUGU 


1798 


ACACU UGAUGGCAUGCACUAUGCGCG ACUCAGGCCU 


4577 


35 85 


CUGAGUGAGU G UUUGG 


1799 


CCAAA UGAUGGCAUGCACUAUGCGCG ACUCACUCAG 


4578' 




GUGUUUGGCC G AGGCC 


1800 


GGCCU UGAUGGCAUGCACUAUGCGCG GGCCAAACAC 


4579 




GCCGAGGCCU G CAUGU 


1801 


ACAUG UGAUGGCAUGCACUAUGCGCG AGGCCUCGGC 


4580 


3604 


AGGCCUGCAU G UCCGG 


1802 


CCGGA UGAUGGCAUGCACUAUGCGCG AUGCAGGCCU 


4581 


3612 


AUGUCCGGCU G AAGGC 


1803 


GCCUU UGAUGGCAUGCACUAUGCGCG AGCCGGACAU 


4582 


3619 


GCUGAAGGCU G AGUGU 


1804 


ACACU UGAUGGCAUGCACUAUGCGCG AGCCUUCAGC 


4583 


3623 


AAGGCUGAGU G UCCGG 


1805 


CCGGA UGAUGGCAUGCACUAUGCGCG ACUCAGCCUU 


4584 


3 631 


GUGUCCGGCU G AGGCC 


1806 


GGCCU UGAUGGCAUGCACUAUGCGCG AGCCGGACAC 


4585 


3638 


GCUGAGGCCU G AGCGA 


1807 


UCGCU UGAUGGCAUGCACUAUGCGCG AGGCCUCAGC 


4586 


3642 


TV ^^^J'WT^ TV Tl ^TT^TT 

AGGCCUGAGC G AGUGU 


1808 


ACACU UGAUGGCAUGCACUAUGCGCG GCUCAGGCCU 


4587 


3 646 


CUGAGCGAGU G UCCAG 


1809 


CUGGA UGAUGGCAUGCACUAUGCGCG ACUCGCUCAG 


4588 


3661 


TV TV /^/^^^T T ^ TV/^TT/->ITT 

GCCAAGGGCU G AGUGU 


1810 


ACACU UGAUGGCAUGCACUAUGCGCG AGCCCUUGGC 


4589 


3665 


AGGGCUGAGU G UCCAG 


1811 


CUGGA UGAUGGCAUGCACUAUGCGCG ACUCAGCCCU 


4590 


3678 


CAGCACACCU G CCGUC 


1812 


GACGG UGAUGGCAUGCACUAUGCGCG AGGUGUGCUG 


4591 


3 705 


ACAGGCUGGC G CUCGG 


1813 


CCGAG UGAUGGCAUGCACUAUGCGCG GCCAGCCUGU 


4592 


3 789 


CCCCAGAUUC G CCAUU 


1814 


AAUGG UGAUGGCAUGCACUAUGCGCG GAAUCUGGGG 


4593 


3795 


AUUCGCCAUU G UUCAC 


1815 


GUGAA UGAUGGCAUGCACUAUGCGCG AAUGGCGAAU 


4594 


3806 


UUCACCCCUC G CCCUG 


1816 


CAGGG UGAUGGCAUGCACUAUGCGCG GAGGGGUGAA 


4595 


3 811 


CCCUCGCCCU G CCCUC 


1817 


GAGGG UGAUGGCAUGCACUAUGCGCG AGGGCGAGGG 


4596 


3821 


fir'Pr'T TPPT TT TT T n r'PTTTTP 


± t5 ± O 


(jAACjOj UOAUtjULAUGLALUAUGCGCG AAAGGAGGGC 


4597 


3854 


UGGAGACCCU G AGAAG 


1819 


CUUCU UGAUGGCAUGCACUAUGCGCG AGGGUCUCCA 


4598 


3888 


AAUUUGGAGU G ACCAA 


1820 


UUGGU UGAUGGCAUGCACUAUGCGCG ACUCCAAAUU 


4599 


3898 


GACCAAAGGU G UGCCC 


1821 


GGGCA UGAUGGCAUGCACUAUGCGCG ACCUUUGGUC 


4600 


3900 


CCAAAGGUGU G CCCUG 


1822 


CAGGG UGAUGGCAUGCACUAUGCGCG ACACCUUUGG 


4601 


3905 


GGUGUGCCCU G UACAC 


1823 


GUGUA UGAUGGCAUGCACUAUGCGCG AGGGCACACC 


4602 
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3915 


GUACACAGGC G AGGAC 


1824 


GUCCU UGAUGGCT^UGCACUAUGCGCG GCCUGUGUAC 


4603 


3924 


CGAGGACCCU G CACCU 


1825 


AGGUG UGAUGGCAUGCACUAUGCGCG AGGGUCCUCG 


4604 


3944 


GGGGGUCCCU G UGGGU 


1826 


AC CCA UGAUGGCAUGCACUAUGCGCG AGGGACCCCC 


4 605 


3966 


GGGGGGAGGU G CUGUG 


1827 


CACAG UGAUGGCAUGCACUAUGCGCG ACCUCCCCCC 


4606 


3969 


GGGAGGUGCU G UGGGA 


1828 


UCCCA UGAUGGCAUGCACUAUGCGCG AGCACCUCCC 


4607 


3985 


GUAAAAUACU G AAUAU 


1829 


AUAUU UGAUGGCAUGCACUAUGCGCG AGUAUUUUAC 


4608 


3993 


CUGAAUAUAU G AGUUU 


1830 


AAACU UGAUGGCAUGCACUAUGCGCG AUAUAUUCAG 


4609 


4008 


UUUCAGUUUU G AAAAA 


1831 


UUUUU UGAUGGCAUGCACUAUGCGCG AAAACUGAAA 


4610 



Seql = TERT (Homo sapiens telomerase reverse transcriptase (TERT) mRNA, 4015 bp); Nakamura et al.. Science 277 (5328), 
955-959(1997) 

Input Sequence = TERT. Cut Site = YG/M or UG/U. 

Stem Length = 5/10. Core Sequence = UGAUG GCAUGCACUAUGC GCG 
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Table VI: Human telomerase reverse transcriptase (TERT) DNAzyme and Target 
Sequence 



nt 
Position 


DNAzyme Sequence 


Seq. ID 
Nos 


Substrate 


Seq. ID 
Nos 


9 


CAGGACGC GGCTAGCTACAACGA AGCGCTGC 


1832 


GCAGCGCU G GCGUCCUG 


4611 


11 


AGCAGGAC GGCTAGCTACAACGA GCAGCGCT 


1833 


AGCGCUGC G GUCCUGCU 


4612 


16 


TGCGCAGC GGCTAGCTACAACGA AGGACGCA 


1834 


UGCGUCCU G GCUGCGCA 


4613 


19 


ACGTGCGC GGCTAGCTACAACGA AGCAGGAC 


1835 


GUCCUGCU G GCGCACGU 


4614 


21 


CCACGTGC GGCTAGCTACAACGA GCAGCAGG 


1836 


CCUGCUGC G GCACGUGG 


4615 


23 


TCCCACGT GGCTAGCTACAACGA GCGCAGCA 


1837 


UGCUGCGC A ACGUGGGA 


4616 


25 


CTTCCCAC GGCTAGCTACAACGA GTGCGCAG 


1838 


CUGCGCAC G GUGGGAAG 


4617 


32 


GCCAGGGC GGCTAGCTACAACGA TTCCCACG 


1839 


CGUGGGAA G GCCCUGGC 


4618 


38 


GCCGGGGC GGCTAGCTACAACGA CAGGGCTT 


1840 


AAGCCCUG G GCCCCGGC 


4619 


44 


GGGGTGGC GGCTAGCTACAACGA CGGGGCCA 


1841 


UGGCCCCG G GCCACCCC 


4620 


47 


GCGGGGGT GGCTAGCTACAACGA GGCCGGGG 


1842 


CCCCGGCC A ACCCCCGC 


4621 


53 


GGCATCGC GGCTAGCTACAACGA GGGGGTGG 


1843 


CCACCCCC G GCGAUGCC 


4622 


56 


CGCGGCAT GGCTAGCTACAACGA CGCGGGGG 


1844 


CCCCCGCG A AUGCCGCG 


4623 


58 


CGCGCGGC GGCTAGCTACAACGA ATCGCGGG 


1845 


CCCGCGAU G GCCGCGCG 


4624 


61 


GAGCGCGC GGCTAGCTACAACGA GGCATCGC 


1846 


GCGAUGCC G GCGCGCUC 


4625 


63 


GGGAGCGC GGCTAGCTACAACGA GCGGCATC 


1847 


GAUGCCGC G GCGCUCCC 


4626 


65 


CGGGGAGC GGCTAGCTACAACGA GCGCGGCA 


1848 


UGCCGCGC G GCUCCCCG 


4627 


72 


TCGGCAGC GGCTAGCTACAACGA GGGGAGCG 


1849 


CGCUCCCC G GCUGCCGA 


4628 


75 


GGCTCGGC GGCTAGCTACAACGA AGCGGGGA 


1850 


UCCCCGCU G GCCGAGCC 


4629 


80 


CGCACGGC GGCTAGCTACAACGA TCGGCAGC 


1851 


GCUGCCGA G GCCGUGCG 


4630 


83 


GAGCGCAC GGCTAGCTACAACGA GGCTCGGC 


1852 


GCCGAGCC G GUGCGCUC 


4631 


85 


GGGAGCGC GGCTAGCTACAACGA ACGGCTCG 


1853 


CGAGCCGU G GCGCUCCC 


4632 


87 


CAGGGAGC GGCTAGCTACAACGA GCACGGCT 


1854 


AGCCGUGC G GCUCCCUG 


4633 


94 


TGCGCAGC GGCTAGCTACAACGA AGGGAGCG 


1855 


CGCUCCCU G GCUGCGCA 


4634 


97 


GGCTGCGC GGCTAGCTACAACGA AGCAGGGA 


1856 


UCCCUGCU G GCGCAGCC 


4635 


99 


GTGGCTGC GGCTAGCTACAACGA GCAGCAGG 


1857 


CCUGCUGC G GCAGCCAC 


4636 


102 


GTAGTGGC GGCTAGCTACAACGA TGCGCAGC 


1858 


GCUGCGCA G GCCACUAC 


4637 


105 


GCGGTAGT GGCTAGCTACAACGA GGCTGCGC 


1859 


GCGCAGCC A ACUACCGC 


4638 


108 


CTCGCGGT GGCTAGCTACAACGA AGTGGCTG 


1860 


CAGCCACU A ACCGCGAG 


4639 


111 


CACCTCGC GGCTAGCTACAACGA GGTAGTGG 


1861 


CCACUACC G GCGAGGUG 


4640 


116 


GGCAGCAC GGCTAGCTACAACGA CTCGCGGT 


1862 


ACCGCGAG G GUGCUGCC 


4641 


lis 


GCGGCAGC GGCTAGCTACAACGA ACCTCGCG 


1863 


CGCGAGGU G GCUGCCGC 


4642 


121 


CCAGCGGC GGCTAGCTACAACGA AGCACCTC 


1864 


GAGGUGCU G GCCGCUGG 


4643 


124 


TGGCCAGC GGCTAGCTACAACGA GGCAGCAC 


1865 


GUGCUGCC G GCUGGCCA 


4644 


128 


AACGTGGC GGCTAGCTACAACGA CAGCGGCA 


1866 


UGCCGCUG G GCCACGUU 


4645 


131 


ACGAACGT GGCTAGCTACAACGA GGCCAGCG 


1867 


CGCUGGCC A ACGUUCGU 


4646 


133 


GCACGAAC GGCTAGCTACAACGA GTGGCCAG 


1868 


CUGGCCAC G GUUCGUGC 


4647 


137 


CGCCGCAC GGCTAGCTACAACGA GAACGTGG 


1869 


CCACGUUC G GUGCGGCG 


4648 


139 


GGCGCCGC GGCTAGCTACAACGA ACGAACGT 


1870 


ACGUUCGU G GCGGCGCC 


4649 


142 


CCAGGCGC GGCTAGCTACAACGA CGCACGAA 


1871 


UUCGUGCG G GCGCCUGG 


4650 


144 


CCCCAGGC GGCTAGCTACAACGA GCCGCACG 


1872 


CGUGCGGC G GCCUGGGG 


4651 
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151 


CCTGGGGC GGCTAGCTACTVACGA CCCAGGCG 


1873 


CGCCUGGG G GCCCCAGG 


4652 


159 


CCGCCAGC GGCTAGCTACAACGA CCTGGGGC 


1874 


GCCCCAGG G GCUGGCGG 


4653 


163 


CCAGCCGC GGCTAGCTACAACGA CAGCCCTG 


1875 


CAGGGCUG G GCGGCUGG 


4654 


166 


GCACCAGC GGCTAGCTACAACGA CGCCAGCC 


1876 


GGCUGGCG G GCUGGUGC 


4655 


170 


CGCTGCAC GGCTAGCTACAACGA CAGCCGCC 


1877 


y^ ✓^y^ x*i ✓^T Ty^ y^r y^ Tv ^*ly^ 

GGCGGCUG G GUGCAGCG 


4656 


172 


CGCGCTGC GGCTAGCTACAACGA ACCAGCCG 


1878 


CGGCUGGU G GCAGCGCG 


4657 


175 


CCCCGCGC GGCTAGCTACAACGA TGCACCAG 


1879 


CUGGUGCA G GCGCGGGG 


4658 


177 


GTCCCCGC GGCTAGCTACAACGA GCTGCACC 


1880 


GGUGCAGC G GCGGGGAC 


4659 


183 


CGCCGGGT GGCTAGCTACAACGA CCCCGCGC 


1881 


GCGCGGGG A ACCCGGCG 


4660 


188 


AAAGCCGC GGCTAGCTACAACGA CGGGTCCC 


1882 


GGGACCCG G GCGGCUUU 


4661 


191 


CGGAAAGC GGCTAGCTACAACGA CGCCGGGT 


1883 


ACCCGGCG G GCUUUCCG 


4662 


198 


CAGCGCGC GGCTAGCTACAACGA GGAAAGCC 


1884 


GGCUUUCC G GCGCGCUG 

• 


4663 


200 


7\ "A /~i /-^ / — \ /-^ /-^ m TV /^m TV tv tv /^/^ tv ^^/^/^^ tv tv tv /*i 

ACCAGCGC GGCTAGCTACAACGA GCGGAAAG 


1885 


CUUUCCGC G GCGCUGGU 


4664 


202 


/^/^ TV TV rn TV /^m tv tv tv /^/^ tv /^y*i tv tv 

CCACCAGC GGCTAGCTACAACGA GCGCGGAA 


1886 


' UUCCGCGC G GCUGGUGG 


4665 


206 


m/^/^/^/^/^ TV /"* /^/^^^rn tv /^/^m tv tv tv iOi/^i tv y^ tv /^y^^y^/^ 

TGGGCCAC GGCTAGCTACAACGA CAGCGCGC 


1887 


GCGCGCUG G GUGGCCCA 


4666 


209 


/*^7i ^111 y^ ^^/^^^rmv y^/^rmv /^tv tv ^^^^ tv ^^tv ^^^tv ^^y^^^ 

CACTGGGC GGCTAGCTACAACGA CACCAGCG 


1888 


CGCUGGUG G GCCCAGUG 


4667 


214 


CCAGGCAC GGCTAGCTACAACGA TGGGCCAC 


1889 


GUGGCCCA G GUGCCUGG 


4668 


216 


TV /I 'K /^/^ y^/^/^m TV y^ y^m tv y^Tv tv y^y^ tv tv /^m/^y^A^ y**i/^ 

CACCAGGC GGCTAGCTACAACGA ACTGGGCC 


1890 


GGCCCAGU G GCCUGGUG 


4669 


221 


TV /^/^ TV TV y^ /^m TV TV y^ TV TV y^y^ tv g-% tv y^m 

ACGCACAC GGCTAGCTACAACGA CAGGCACT 


1891 


AGUGCCUG G GUGUGCGU 


4670 


223 


^1 IV ^1^1 TV ^^^^^l^n TV ^1 TV ^tTV TV TV % ^^^^ TV ^^^^ TV 

GCACGCAC GGCTAGCTACAACGA ACCAGGCA 


1892 


UGCCUGGU G GUGCGUGC 


4671 


IOC 


GGGCACGC GGCTAGCTACAACGA ACACCAGG 


18 93 


CCUGGUGU G GCGUGCCC 


4672 


227 


^ i^f^f^ TV /^m TV /^rri tv tv tv r^r^ -x tv y^ tv ^ tv 

CAGGGCAC GGCTAGCTACAACGA GCACACCA 


1894 


UGGUGUGC G GUGCCCUG 


4673 ' 


TOO 

22y 


f^f^K^ TV /*T /^/^ /^/^ /^m TV /^m TV y^ tv tv y^/^ tv tv <^ y^ TV TV y^ 

CCCAGGGC GGCTAGCTACAACGA ACGCACAC 


1895 


GUGUGCGU G GCCCUGGG 


4674 


237 


CCGTGCGT GGCTAGCTACAACGA CCCAGGGC 


1896 


GCCCUGGG A ACGCACGG 


4675 


239 


GGCCGTGC GGCTAGCTACAACGA GTCCCAGG 


1897 


CCUGGGAC G GCACGGCC 


4676 


2^1 


GCGGCCGT GGCTAGCTACAACGA GCGTCCCA 


1898 


UGGGACGC A ACGGCCGC 


4677 


244 


GGGGCGGC GGCTAGCTACAACGA CGTGCGTC 


1899 


GACGCACG G GCCGCCCC 


4678 


247 


CGGGGGGC GGCTAGCTACAACGA GGCCGTGC 


1900 


GCACGGCC G GCCCCCCG 


4679 


254 


GGGGCGGC GGCTAGCTACAACGA GGGGGGCG 


1901 


CGCCCCCC G GCCGCCCC 


4680 


257 


GAGGGGGC GGCTAGCTACAACGA GGCGGGGG 


1902 


CCCCCGCC G GCCCCCUC 


4681 


270 


CACCTGGC GGCTAGCTACAACGA GGAAGGAG 


1903 


CUCCUUCC G GCCAGGUG 


4682 


T T C 

2 /b 


CAGGACAC GGCTAGCTACAACGA CTGGCGGA 


1904 


UCCGCCAG G GUGUCCUG 


4683 


2 11 


f^r^f^ TV TV y^ y^ ^^rrr tv /ni tv tv tv ^^/^ iv tv y^yin iy^y*i ^^^^ 

GGCAGGAC GGCTAGCTACAACGA ACCTGGCG 


1905 


CGCCAGGU G GUCCUGCC 


4684 


O O 1 

2o2 


CTTCAGGC GGCTAGCTACAACGA AGGACACC 


1906 


GGUGUCCU G GCCUGAAG 


4685 


2^2 


CCACCAGC GGCTAGCTACAACGA TCCTTCAG 


1907 


CUGAAGGA G GCUGGUGG 


4686 


o o ^ 

296 


CGGGCCAC GGCTAGCTACAACGA CAGCTCCT 


1908 


AGGAGCUG G GUGGCCCG 


4687 


299 


ACTCGGGC GGCTAGCTACAACGA CACCAGCT 


1909 


AGCUGGUG G GCCCGAGU 


4688 


305 


TGCAGCAC GGCTAGCTACAACGA TCGGGCCA 


1910 


UGGCCCGA G GUGCUGCA 


4689 


307 


TCTGCAGC GGCTAGCTACAACGA ACTCGGGC 


1911 


GCCCGAGU G GCUGCAGA 


4690 


310 


GCCTCTGC GGCTAGCTACAACGA ' AGCACTCG 


1912 


CGAGUGCU G GCAGAGGC 


4691 


316 


CGCACAGC GGCTAGCTACAACGA CTCTGCAG 


1913 


CUGCAGAG G GCUGUGCG 


4692 


319 




T Q 1 /I 


CACjAGGCU G GUGCGAGC 


4693 


321 


GCGCTCGC GGCTAGCTACAACGA ACAGCCTC 


1915 


GAGGCUGU G GCGAGCGC 


4694 


325 


CGCCGCGC GGCTAGCTACAACGA TCGCACAG 


1916 


CUGUGCGA G GCGCGGGG 


4695 


327 


CGCGCCGC GGCTAGCTACAACGA GCTCGCAC 


1917 


GUGCGAGC G GCGGCGCG 


4696 


330 


CTTCGCGC GGCTAGCTACAACGA CGCGCTCG 


1918 


CGAGCGCG G GCGCGAAG 


4697 


332 


TTCTTCGC GGCTAGCTACAACGA GCCGCGCT 


1919 


AGCGCGGC G GCGAAGAA 


4698 
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339 


CAGCACGT GGCTAGCTACAACGA TCTTCGCG 


1920 


CGCGAAGA A ACGUGCUG 


4699 


341 


GCCAGCAC GGCTAGCTACAACGA GTTCTTCG 


1921 


CGAAGAAC G GUGCUGGC 


4700 


343 


AGGCCAGC GGCTAGCTACAACGA ACGTTCTT 


1922 


AAGAACGU G GCUGGCCU 


4701 


347 


TV TV /^/^ /^rr* TV /^m tv tv tv /^/^ tv tv /~i tv m 

CCGAAGGC GGCTAGCTACAACGA CAGCACGT 


1923 


ACGUGCUG G GCCUUCGG 


4702 


354 


CGCGAAGC GGCTAGCTACAACGA CGAAGGCC 


1924 


GGCCUUCG G GCUUCGCG 


4703 


359 


AGCAGCGC GGCTAGCTACAACGA GAAGCCGA 


1925 


UCGGCUUC G GCGCUGCU 


4704 


361 


f^r> TV TV /-I /-^m TV /^m tv tv tv tv /^/^ tv tv 

CCAGCAGC GGCTAGCTACAACGA GCGAAGCC 


1926 


GGCUUCGC G GCUGCUGG 


4705 


364 


CGTCCAGC GGCTAGCTACAACGA AGCGCGAA 


1927 


UUCGCGCU G GCUGGACG 


4706 


369 


GGCCCCGT GGCTAGCTACAACGA CCAGCAGC 


1928 


GCUGCUGG A ACGGGGCC 


4707 


374 


CCGCGGGC GGCTAGCTACAACGA CCCGTCCA 


1929 


UGGACGGG G GCCCGCGG 


4708 


378 


GCCCCCGC GGCTAGCTACAACGA GGGCCCCG 


1930 


CGGGGCCC G GCGGGGGC 


4709 


384 


GGGGGGGC GGCTAGCTACAACGA CCCCGCGG 


1931 


CCGCGGGG G GCCCCCCC 


4710 


395 


GTGAAGGC GGCTAGCTACAACGA CTCGGGGG 


1932 


CCCCCGAG G GCCUUCAC 


4711 


4 01 


CTGGTGGT GGCTAGCTACAACGA GAAGGCCT 


1933 


AGGCCUUC A ACCACCAG 


4712 


404 


ACGCTGGT GGCTAGCTACAACGA GGTGAAGG 


1934 


CCUUCACC A ACCAGCGU 


4713 


408 


GCGCACGC GGCTAGCTACAACGA TGGTGGTG 


1935 


CACCACCA G GCGUGCGC 


4714 


410 


CTGCGCAC GGCTAGCTACAACGA GCTGGTGG 


1936 


CCAGCAGC G GUGCGCAG 


4715 


412 


AGCTGCGC GGCTAGCTACAACGA ACGCTGGT 


1937 


ACCAGCGU G GCGCAGCU 


4716 


414 


GTAGCTGC GGCTAGCTACAACGA GCACGCTG 


1938 


CAGCGUGC G GCAGCUAC 


4717 


417 


CAGGTAGC GGCTAGCTACAACGA TGCGCACG 


1939 


CGUGCGCA G GCUACCUG 


4718 


420 


GGGCAGGT GGCTAGCTACAACGA AGCTGCGC 


1940 


GCGCAGCU A ACCUGCCC 


4719 


424 


TGTTGGGC GGCTAGCTACAACGA AGGTAGCT 


1941 


AGCUACCU G GCCCAACA 


4720 


429 


CACCGTGT GGCTAGCTACAACGA TGGGCAGG 


1942 


CCUGCCCA A ACACGGUG 


4721 


431 


GTCACCGT GGCTAGCTACAACGA GTTGGGCA 


1943 


UGCCCAAC A ACGGUGAC 


4722 


434 


TCGGTCAC GGCTAGCTACAACGA CGTGTTGG 


1944 


CCAACACG G GUGACCGA 


4723 


43 7 


GCGTCGGT GGCTAGCTACAACGA CACCGTGT 


1945 


ACACGGUG A ACCGACGC 


4724 


441 


CAGTGCGT GGCTAGCTACAACGA CGGTCACC 


1946 


GGUGACCG A ACGCACUG 


4725 


443 


CGCAGTGC GGCTAGCTACAACGA GTCGGTCA 


1947 


UGACCGAC G GCACUGCG 


4726 


445 


CCCGCAGT GGCTAGCTACAACGA GCGTCGGT 


1948 


ACCGACGC A ACUGCGGG 


4727 


448 


TCCCCCGC GGCTAGCTACAACGA AGTGCGTC 


1949 


GACGCACU G GCGGGGGA 


4728 


456 


CGCCCCGC GGCTAGCTACAACGA TCCCCCGC 


1950 


GCGGGGGA G GCGGGGCG 


4729 


461 


CCCCACGC GGCTAGCTACAACGA CCCGCTCC 


1951 


GGAGCGGG G GCGUGGGG 


4730 


463 


GCCCCCAC GGCTAGCTACAACGA GCCCCGCT 


1952 


AGCGGGGC G GUGGGGGC 


4731 


469 


GCAGCAGC GGCTAGCTACAACGA CCCCACGC 


1953 


GCGUGGGG G GCUGCUGG 


4732 


472 


GCAGCAGC GGCTAGCTACAACGA AGCCCCCA 


1954 


UGGGGGCU G GCUGCUGG 


4733 


475 


GGCGCAGC GGCTAGCTACAACGA AGCAGCCC 


1955 


GGGCUGCU G GCUGCGCC 


4734 


4 78 


CGCGGCGC GGCTAGCTACAACGA AGCAGCAG 


1956 


CUGCUGCU G GCGCCGCG 


4735 


480 


CACGCGGC GGCTAGCTACAACGA GCAGCAGC 


1957 


GCUGCUGG G GCCGCGUG 


4736 


483 


GCCCACGC GGCTAGCTACAACGA GGCGCAGC 


1958 


GCUGCGCC G GCGUGGGG 


4737 


485 


TCGCCCAC GGCTAGCTACAACGA GCGGCGCA 


1959 


UGCGCCGC G GUGGGCGA 


4738 


4 8 9 


GTCGTCGC GGCTAGCTACAACGA CCACGCGG 


1960 


CCGCGUGG G GCGACGAC 


4739 


492 




lybl 


CGUGGGCG A ACGACGUG 


4740 


495 


CAGCACGT GGCTAGCTACAACGA CGTCGCCC 


1962 


GGGCGACG A ACGUGCUG 


4741 


497 


ACCAGCAC GGCTAGCTACT^CGA GTCGTCGC 


1963 


GCGACGAC G GUGCUGGU 


4742 


499 


GAACCAGC GGCTAGCTACAACGA ACGTCGTC 


1964 


GACGACGU G GCUGGUUC 


4743 


503 


AGGTGAAC GGCTAGCTACAACGA CAGCACGT 


1965 


ACGUGCUG G GUUCACCU 


4744 


507 


CAGCAGGT GGCTAGCTACAACGA GAACCAGC 


1966 


GCUGGUUC A ACGUGCUG 


4745 
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511 


GTGCCAGC GGCTAGCTACAACGA AGGTGAAC 


1967 


GUUCACCU G GCUGGCAC 


4746 


515 


CAGCGTGC GGCTAGCTACAACGA CAGCAGGT 


1968 


ACCUGCUG G GCACGCUG 


4747 


517 


CGCAGCGT GGCTAGCTACAACGA GCCAGCAG 


1969 


CUGCUGGC A ACGCUGCG 


4 748 


519 


CGCGCAGC GGCTAGCTACAACGA GTGCCAGC 


1970 


GCUGGCAC G GCUGCGCG 


4749 


522 


GAGCGCGC GGCTAGCTACAACGA AGCGTGCC 


1971 


GGCACGCU G GCGCGCUC 


4750 


524 


AAGAGCGC GGCTAGCTACAACGA GCAGCGTG 


1972 


CACGCUGC G GCGCUCUU 


4751 


526 


CAAAGAGC GGCTAGCTACAACGA GCGCAGCG 


1973 


CGCUGCGC G GCUCUUUG 


4752 


533 


TV TV /~t /~i TV rn TV y^m tv ^ tv ti /^/^ tv tv tv tv tv /^/^ 

ACCAGCAC GGCTAGCTACAACGA AAAGAGCG 


1974 


CGCUCUUU G GUGCUGGU 


4753 


535 


^i^iTV ^i^* TV ^1 /^i^^^^rrt TV ^^^^^n tv tv tv ^^^^ tv tv tv tv tv tv 

CCACCAGC GGCTAGCTACAACGA ACAAAGAG 


1975 


CUCUUUGU G GCUGGUGG 


4754 


539 


TV /~1 /^/^ TV /^m TV /^m TV ^ TV TV /~1 /~1 TV ^ TV ^ TV TV TV 

GGAGCCAC GGCTAGCTACAACGA CAGCACAA 


1976 


UUGUGCUG G GUGGCUCC 


4755 


542 


CTGGGAGC GGCTAGCTACAACGA CACCAGCA 


1977 


UGCUGGUG G GCUCCCAG 


4756 


549 


GGCGCAGC GGCTAGCTACAACGA TGGGAGCC 


1978 


GGCUCCCA G GCUGCGCC 


4757 


552 


GTAGGCGC GGCTAGCTACAACGA AGCTGGGA 


1979 


UCCCAGCU G GCGCCUAC 


4758 


554 


TGGTAGGC GGCTAGCTACAACGA GCAGCTGG 


1980 


CCAGCUGC G GCCUACCA 


4759 


558 


CACCTGGT GGCTAGCTACAACGA AGGCGCAG 


1981 


CUGCGCCU A ACCAGGUG 


4760 


563 


/^/^ TV TV /-I rn TV tv /~i tv tv /^i^ tv ✓^m/^i^^m tv 

CCGCACAC GGCTAGCTACAACGA CTGGTAGG 


1982 


CCUACCAG G GUGUGCGG 


4761 


565 


GCCCGCAC GGCTAGCTACAACGA ACCTGGTA 


1983 


UACCAGGU G GUGCGGGC 


4762 


567 


CGGCCCGC GGCTAGCTACAACGA ACACCTGG 


1984 


CCAGGUGU G GCGGGCCG 


4763 


571 


GCGGCGGC GGCTAGCTACAACGA CCGCACAC 


1985 


GUGUGCGG G GCCGCCGC 


4764 


574 


ACAGCGGC GGCTAGCTACAACGA GGCCCGCA 


1986 


UGCGGGCC G GCCGCUGU 


4765 


577 


GGTACAGC GGCTAGCTACAACGA GGCGGCCC 


1987 


GGGCCGCC G GCUGUACC 


4766 


580 


GCTGGTAC GGCTAGCTACAACGA AGCGGCGG 


1988 


CCGCCGCU G GUACCAGC 


4767 


582 


GAGCTGGT GGCTAGCTACAACGA ACAGCGGC 


1989 


GCCGCUGU A ACCAGCUC 


4768 


586 


CGCCGAGC GGCTAGCTACAACGA TGGTACAG 


1990 


CUGUACCA G GCUCGGCG 


4769 


591 


GGCAGCGC GGCTAGCTACAACGA CGAGCTGG 


1991 


CCAGCUCG G GCGCUGCC 


4770 


593 


GTGGCAGC GGCTAGCTACAACGA GCCGAGCT 


1992 


AGCUCGGC G GCUGGCAC 


4771 


596 


TGAGTGGC GGCTAGCTACAACGA AGCGCCGA 


1993 


UCGGCGCU G GCCACUCA 


4772 


599 


GCCTGAGT GGCTAGCTACAACGA GGCAGCGC 


1994 


GCGCUGCC A ACUCAGGC 


4773 


605 


GGCCGGGC GGCTAGCTACAACGA CTGAGTGG 


1995 


CCACUCAG G GCCCGGCC 


4774 


610 


GCGGGGGC GGCTAGCTACAACGA CGGGCCTG 


1996 


CAGGCCCG G GCCCCCGC 


4775 


616 


CGTGTGGC GGCTAGCTACAACGA GGGGGCCG 


1997 


CGGCCCCC G GCCACACG 


4776 


619 


TAGCGTGT GGCTAGCTACAACGA GGCGGGGG 


1998 


CCCCCGCC A ACACGCUA 


4777 


621 


ACTAGCGT GGCTAGCTACAACGA GTGGCGGG 


1999 


CCCGCCAC A ACGCUAGU ■ 


4778 


623 


CCACTAGC GGCTAGCTACAACGA GTGTGGCG 


2000 


CGCCACAC G GCUAGUGG 


4779 


627 


GGGTCCAC GGCTAGCTACAACGA TAGCGTGT 


2001 


ACACGCUA G GUGGACCC 


4780 


631 


TTCGGGGT GGCTAGCTACAACGA CCACTAGC 


2002 


GCUAGUGG A ACCCCGAA 


4781 


64 0 


CCAGACGC GGCTAGCTACAACGA CTTCGGGG 


2003 


CCCCGAAG G GCGUCUGG 


4782 


642 


TCCCAGAC GGCTAGCTACAACGA GCCTTCGG 


2004 


CCGAAGGC G GUCUGGGA 


4783 


649 


GTTCGCAT GGCTAGCTACAACGA CCCAGACG 


2005 


CGUCUGGG A AUGCGAAC 


4784 


651 


CCGTTCGC GGCTAGCTACAACGA ATCCCAGA 


2006 


UCUGGGAU G GCGAACGG 


4785 


655 


AGGCCCGT GGCTAGCTACAACGA TCGCATCC 


2007 


GGAUGCGA A ACGGGCCU 


4786 


O O J 




O A o 

z 008 


GCGAACGG G GCCUGGAA 


4787 


666 


GCTATGGT GGCTAGCTACAACGA TCCAGGCC 


2009 


GGCCUGGA A ACCAUAGC 


4788 


669 


GACGCTAT GGCTAGCTACAACGA GGTTCCAG 


2010 


CUGGAACC A AUAGCGUC 


4789 


672 


CCTGACGC GGCTAGCTACAACGA TATGGTTC 


2011 


GAACCAUA G GCGUCAGG 


4790 


674 


TCCCTGAC GGCTAGCTACAACGA GCTATGGT 


2012 


ACCAUAGC G GUCAGGGA 


4791 


683 


ACCCCGGC GGCTAGCTACAACGA CTCCCTGA 


2013 


UCAGGGAG G GCCGGGGU 


4792 
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D O 27 


AovjOLjLjAC (nJoL. i AVjL. J. AL.AAL.VjA L, V- v-taVjL. L. i 


zU±4 


AGGGGGGG G GUGGGGGU 


yi T Q O 


Q Q 


L LjOLALjOL ovjL i ALjL. i ALAALoA LLAooLjVjVj 


z OId 


^^^^^^^^^^^^^^^^ o o ot t/^ o o a 
GGGGGUGG G GGGUGGGA 


4794 


/U J 


tjtjVjL 1 LrtjL. LjLjL i ALjL. 1 AL.AAL.LjA ALiLxLLL-AG 


2016 


GUGGGCCU G GCCAGGCC 


4795 


/ U / 


L.LL. WjLrtjV- LKjL. i ALiL. i AL.AAv-CaA i (j<jLA(jC3L 


2 017 


GGGUGGGA G GGCCCGGG 


4796 


"71 A 


L.L i L-vjUAL. ULjL 1 AL>L. i AL.AAL.CiA LLvjVjLjCjLT 


2018 


AGCCCCGG G GUGCGAGG 


4797 


/ X o 


LiLLiLLrL LioL i ALtL 1 ALAALGA ACLLGUGb 


2019 


CCCCGGGU G GCGAGGAG 


4798 


/ «6 ft 


r*r*r*r*r*cr^r^ /^/^OT'Ar^O'T'AOA a r^r^ a /-irp/-i/-irp/-(/-io 
V-LuLGLGL, GGLiAGL.iAL.AALGA LILLILGL 


2 02 0 


GCGAGGAG G GCGCGGGG 


4799 




oL.L.L.V^L.LjL. kjV7UlAVjL.lAL.AAL.vjA VjL.dCV-lL 


T n o 1 


GAGGAGGG G GGGGGGGC 


4800 




VjIjCAL. 1 OV-. VjvjL. 1 AVjL. 1AL.AAL.VjA L.k^L.L.vjUVjL. 




GCGCGGGG G GCAGUGCC 


4801 


•7 c: 


vjLILtVjL-AL GGL i AGL. i ALAALGA iGL.L.L.L.LG 


T n o T 


GGGGGGGA G GUGGCAGC 


4802 




L.VjoV.^ 1 vjVjV». LtVjL. i AGL. i AL.AALL»A AL. i VjV,-L.V-,L 


o rt o >t 
zUz4 


GGGGGAGU G GGGAGGGG 


4803 




APT"PP/'2PP ^TiPPrpfiPPTapa appa tpppjvptp 
AL. I 1 V-vjVjV«. VjVjL^ 1 AVjL. 1 AL.AAL.VjA 1 VjVjv^AL. 1 Vj 


o n o c 


GAGUGGGA G GCGGAAGU 


4804 




r*n(^f^'h.r*'hr^ pppTappTaPA appa ttippppt"/^ 
V».L>VjL AVjAU VjLiV_ 1 ALrL. 1 ALJiAL.L» A 1 1 LoVjL. 1 Vj 


O o ^ 


CAGCCGAA G GUCUGCCG 


4805 


7 c; 1 


r^r^'hti^r^r^r^r^ pr^r^TA^^/^TiA/^A a/^/^ a a^^a/^'t't'/^o 
GLAALGGL GGL i AGL, i ALAALGA AGAL i i LG 


202 7 


CGAAGUCU G GCCGUUGC 


4806 


7 c; 4 


T'PPPPAAP Pr^r^T" A^^/^T A /^A A A r^r^r^AOA /^T" 

iGGGCAAL oGLlAvjClALAALoA bGLAGALl 


202 8 


AGUCUGCC G GUUGCCCA 


4807 


7t^ 7 


TPTTPTSPP PPPTapPTapa ivppa AAO/^rr^Ar' 
i V^ 1 1 VjVjLjL. VjVjL 1 AvjL. 1 ALJiAUoA AAL.VjGUAG 


2029 


OT TOOO/^TTTT /^/~»/^/^TV A TV 

GUGGGGUU G GCCCAAGA 


4808 


7^; C 
/DO 


VjL.L.lVjkjLiL oVjLlA\jL,lALAAL.LyA LI LI IVjVjIj 


2030 


CCCAAGAG G GCCCAGGC 


4809 


/ / Z 


r^r*c*r* a pp p pr'/^'PA/^/^TA^A a^^/^ a /-irn/-t/^(^/-i/~irp 
LVjLLALGL oGL. i AGU i AL AALGA LiGGGLLi 


2031 


AGGCCCAG G GCGUGGCG 


4810 


774 
/ / ft 


A (~*r* r^r^ a pr^r^T^A/^/^TA/^A a^oa i^o/^'p/^o/^r' 
AVjL,GV_L,AU GGu i AGG lAUAALGA GGGIGGGL 


2032 


GCCCAGGC G GUGGCGCU 


4811 


inn 
III 


ppp A/^r^r^r* pppTA/^r^TA r»A A A A r'O rt/^T/^ 
GGLAGGGL GGG 1 AGG 1 ALAAGGA GAGGLG 1 G 


2033 


CAGGCGUG G GCGCUGCC 


4812 


77Q 
1 1 y 


fWinPAnp ^'VSPT'appTiPa appTv r*r*r*'h^nr*r*r* 
VjVjVjVjV^AVjL. GGG 1 AGG lAGAALGA GGGALGGG 




GGCGUGGC G GCUGCCCC 


4813 


7 n 7 
/ o ^ 


T^r^t^r^r^r^r^f^ p/^/^t'a/^^t'a/^a aooa a^/^/^o*^ao 
i GAGGGGG GGG i AGG 1 ALAAGGA AGGGGGAG 


2 03 5 


GUGGCGCU G GCCCCUGA 


4814 


7 Q ri 


GGIGGGGG GGG i AGGTACAACGA TCAGGGGC 


2036 


GCCCCUGA G GCCGGAGC 


4815 


7QC 


GGG i GGGG GGG 1 AGGTACAACGA TCCGGCTC 


2037 


GAGCCGGA G GCGGACGC 


4816 


Ron 

o U U 


AGGGGGGl GGG 1 AGGTACAACGA CCGCTCCG 


2038 


GGGAGGGG A AGGGGGGU 


4817 


Q n 7 
o U z 


LAALGGGG GGG rAGGTACAACGA GTCCGCTC 


2039 


GAGCGGAG G GGGCGUUG 


4818 


O U D 


TT* (~*r^r^ A A /^r*r"7'Ar^/^T'A/^A A/^/^ A r*rT*r*r*TT^r*r^ 
IGGGGAAG GGGxAGG 1 AGAAGGA GGGCGTCC 


2040 


GGACGCCC G GUUGGGCA 


4819 


O X J. 


'^r^cr*r*'vr*r* ppp'pao/^'paoa a/^<^a /^/^ a a r^r^/^r^ 
AGGLG 1 GG GGG 1 AGG 1 ALAAGGA CCAACGGG 


2041 


CCCGUUGG G GCAGGGGU 


4820 


OX/ 


L. L G AGGAG GGG 1 AGG 1 AGAAGGA GCCTGCCC 


2042 


GGGCAGGG G GUGCUGGG 


4821 


824 


nPPTPPPP PPPTappTiaPA APP A r*(^T^r*nT\r>r^ 
VjoVj i vjVjVjL. vjVjL. 1 ALtG i AGAAGGA L G AGGAG G 


z043 


GGUCCUGG G GCCCACCC 


4822 


ft 9 ft 
o ^ O 


GGGG GGG 1 GG G 1 AGG 1 AG AAG G A GGGG G GAG 


2044 


CUGGGCCC A AGGGGGGC 


4823 


ft'^il 
O J *4 


or^'POP'T'PP /^/^ OT" A or^T* A A A A t^t^r^r^r^T^rtr* 
GG 1 GG 1 GG GGG 1 AGG 1 AGAAGGA GGGGGTGG 


2045 


CCAGCGGG G GGAGGAGG 


4824 


fl 7 Q 


GGAGGGGl GGG 1 AGGTACAACGA CCTGCCCG 


2046 


GGGGGAGG A AGGCGUGG 


4825 


D /I 1 

oft 1 


GiGGAGGG GGGTAGCTACAACGA GTCCTGCC 


2047 


GGCAGGAC G GCGUGGAC 


4826 


J 


GGG i GGAG GGGTAGCTACAACGA GCGTCCTG 


2048 


GAGGACGC G GUGGACCG 


4827 


PA 7 
Oft / 


GAG 1 GGG 1 GGGTAGCTACAACGA CCACGCGT 


2049 


AGGCGUGG A ACCGAGUG 


4828 


ft c: 7 
O D Z 


AL.GG i GAL GGL 1 AGGTACAACGA TCGGTCCA 


2050 


UGGAGGGA G GUGAGGGU 


4829 


ft c: C 
ODD 


ALGALGGi GGGTAGCTACAACGA CACTCGGT 


2051 


ACCGAGUG A AGGGUGGU 


4830 


ft c: ft 
ODO 


AAA f^f^ A r^r^f^rn TV /^rp tv r> 7\ TV r^f^ A r*/^rt\f^ tv /^mA^ 

GAAAGGAG GGG 1 AGGTACAACGA GGTCACTC 


2052 


GAGUGACC G GUGGUUUC 


4831 


ft CI 
OO J. 


A A ^ AAA ^ /■'/^/^fTi A ^/^rn A ^ A A /^/^ TV /"itv ^^^m/^Tv 

AGAGAAAC GGGTAGCTACAACGA CACGGTCA 


2053 


UGAGGGUG G GUUUGUGU 


4832 


ftC7 


/^Ar^OA/^A*^ /~'/~'OT>AOOrnA/^AAy^/~1A TV/~«TVTVTl/^/^TV 

GAGLAGAL GGG 1 AGGTACAACGA AGAAACCA 


2054 


UGGUUUGU G GUGUGGUG 


4833 


869 


GACACCAC GGCTAGCTACAACRA APAHAAAP 


Z U D D 


niTTTTTPTTPTT P PTTPPTTPT7P 


4 O J 4 


872 


GGTGACAC GGGTAGCTACAACGA CACACAGA 


2056 


UGUGUGUG G GUGUCAGC 


4835 


874 


CAGGTGAC GGGTAGCTACAACGA ACGAGACA 


2057 


UGUGUGGU G GUCAGGUG 


4836 


877 


TGGGAGGT GGGTAGCTACAACGA GACACCAC 


2058 


GUGGUGUG A AGGUGGGA 


4837 


881 


GGTCTGGC GGGTAGCTACAACGA AGGTGACA 


2059 


UGUCACCU G GCCAGACC 


4838 


886 


GGGGGGGT GGGTAGCTACAACGA CTGGGAGG 


2060 


GGUGCGAG A AGCCGCGG 


4839 
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p on 




PPPTAP/^TiA^A AOOA 

GGG 1 AGL 1 AGAAGGA 


GGGTCTGG 


2061 


OO A O A fyf^f^ 

CCAGACCC 


G 


GCCGAAGA 


4840 


R Q Q 




/^/-t/-»rp7\/^/-trp7v /-i 7\ A A 

GGG i AGL X AGAACGA 


TTCTTCGG 


2062 


/^^/~IA A /~l A A 

CCGAAGAA 


G 


GCCACCUC 


4841 


Q n o 




r^f> /^rp TV /^rp A A A 7\ 

GGG 1 AGL 1 ALAALGA 


GGCTTCTT 


2 063 


TV TV TV TV /~1 

AAGAAGCC 


A 


ACCUCUUU 


4 842 






GGL 1 AGL i ALAALGA 


OOrp/^/~»A A A 

LLTLLAAA 


2064 


T TT TT T/~1 TV 

UUUGGAGG 


G 


GUGCGCUC 


4843 


Q1 

yx / 




/— 1 *— 1 ^rp TV <^ /~trp TV O A A A 

GGLTAGCTACAACGA 


ACCCTCCA 


2065 


UGGAGGGU 


G 


GCGCUCUC 


4844 


Q 1 Q 

y ly 


LAbAGAGL 


(~if~\ /"trp A O Orp A O A A O/^ A 

GGLTAGCTACAACGA 


GCACCCTC 


2066 


GAGGGUGC 


G 


GCUCUCUG 


4845 


yz / 


KjL\jL\3 1 CjC 


GGLTAGLTACAACGA 


/^A/^A/*^A/^y^ 

C AG AG AG C 


2067 


GCUCUCUG 


G 


GCACGCGC 


4846 


Q O O 


rp/^/^ /-I /-I rp 


/"•/"•/^rp^z-iz-trpTV T\ 7V r^r^ TV 

GGLTAGCTACAACGA 


/~»/^/^ A A TV 

GCCAGAGA 


2068 


UCUCUGGC 


A 


ACGCGCCA 


4847 


y J ± 


TV /-«rp/"ii'-i r^f-*r> 


GGL TAGCTACAACGA 


GTGCCAGA 


2069 


UCUGGCAC 


G 


GCGCCACU 


4848 


Q "5 "3 


LjLjAVj i CjVjL 


/^O/^rp TV /^^rp TV TV TV 7\ 

GGLTAGLTACAACGA 


r^r^ TTxr^ tv 

GCGTGCCA 


2070 


UGGCACGC 


G 


GCCACUCC 


4849 


Q '3 C 

y J b 


Lj 1 LjVjQjACjX 


O O /^rp TV orn TV TV TV /"i A 

GGCTAGL 1 ALAACGA 


GGCGCGTG 


2071 


CACGCGCC 


A 


ACUCCCAC 


4850 


Q A *? 
y*tz 


oVjAiurLroi 


GGL i. AGL TAL AALGA 


OOO A Orp/^r^ 

GGGAGTGG 


2072 


CCACUCCC 


A 


ACCCAUCC 


4851 




ppapr?paT 


GGL 1 AGL 1 ALAALGA 


GGGTGGGA 


2073 


UCCCACCC 


A 


AUCCGUGG 


4852 


Q n 
y D u 


L.LjLiUtwL.AL 


(^C^(~\rn Ti /-irp A O A AOOA 

GGL i AGL 1 ALAALGA 


e~^/~i TV rp/l/^/^rp 

GGATGGGT 


2074 


TV Tl T T/^ 

ACCCAUCC 


G 


GUGGGCCG 


4853 


Q A 

y 




GGL i AGL 1 ALAACGA 


CCACGGAT 


2075 


AUCCGUGG 


G 


GCCGCCAG 


4854 


Q C "7 

yo / 




OOOT* A OOT" A OA A OO A 

GGL 1 AGLTACAACGA 


GGCCCACG 


2076 


CGUGGGCC 


G 


GCCAGCAC 


4855 


y D ± 




OOOTIAOOTIA OA A OO A 

GGL 1 AGLTACAACGA 


rn/^/-t OOOOO 

TGGCGGCC 


2077 


GGCCGCCA 


G 


GCACCACG 


4856 


y D J 


<^ rp r-i rp 


GGLTAGCTACAACGA 


GCTGGCGG 


2078 


CCGCCAGC 


A 


ACCACGCG 


4857 


y o o 




OOOT" AOOT* A OA A OO A 

GGLTAGCTACAACGA 


GGTGCTGG 


2079 


CCAGCACC 


A 


ACGCGGGC 


4858 


y o o 




PPPT" AOOT A O A AOOA 

GGL i AGL i ALAALGA 


/^rp/^/-irpo/~irp 

GTGGTGCT 


2080 


AGCACCAC 


G 


GCGGGCCC 


4859 






PPPT" APOfP A O A A OO A 

GGL i AGL 1 ALAALGA 


OOOOOrn/^/^ 

CCGCGTGG 


2081 


CCACGCGG 


G 


GCCCCCCA 


4860 


Q 7 Q 


A i 1 VjoA i 


y^/^/^rpTVOOT" A OA A OO A 

GGL i AGL 1 ALAALGA 


GGGGGGCC 


2082 


GGCCCCCC 


A 


AUCCACAU 


4861 




PPPPST'l^'P 
L-oL-vjAi tji 


OOOT* AOOT" A O A A OO A 

GGL i AGL i ALAALGA 


GGATGGGG 


2083 


CCCCAUCC 


A 


ACAUCGCG 


4862 


y o 3 


oUL.VjL.\jAi 


/^/^ /*irp TV /^orp A O A A f^r^ tv 

GGLTAGC 1 ALAALGA 


GTGGATGG 


2084 


CCAUCCAC 


A 


AUCGCGGC 


4863 


Q fl Q 

y o o 




OOOT" A OOrp A OA A OO A 

GGLTAGCTACAACGA 


TV m/^m^^ TV 

GATGTGGA 


2085 


UCCACAUC 


G 


GCGGCCAC 


4864 


y y 1 


GTGGTGGC 


GGCTAGCTACAACGA 


CGCGATGT 


2086 


ACAUCGCG 


G 


GCCACCAC 


4865 


y y^ 


/~i 7\ /~»/^rp/-i/^rp 

GACGTGGT 


GGCTAGCTACAACGA 


GGCCGCGA 


2087 


UCGCGGCC 


A 


ACCACGUC 


4866 


yy / 


AtjGGACGT 


ooorn A r*f%fT%t^ tv tv /^/~% tv 

GGCTAGCTACAACGA 


GGTGGCCG 


2088 


CGGCCACC 


A 


ACGUCCCU 


4867 


Q Q Q 

y y y 


LLAGCjGAC 


o o Ofn A /*» A TV TV fv 

GGCTAGCTACAACGA 


GTGGTGGC 


2089 


GCCACCAC 


G 


GUCCCUGG 


4868 


1 n n Q 
X u U o 


AGGLG i GT 


/~i /~1 /"inn AA^/^rPA /^TV TV /^/^ TV 

GGCTAGCTACAACGA 


CCCAGGGA 


2090 


UCCCUGGG 


A 


ACACGCCU 


4869 


xu xu 


PA AP/^/^/^T" 


OO Orp A OOrp TV TV TV r^/^ A 

GGL i AGLTACAACGA 


GTCCCAGG 


2091 


CCUGGGAC 


A 


ACGCCUUG 


4870 


X U X z 


PAPA APO/^ 

GAUAAvjrtjL. 


OOOT AOOT* A OA A OO A 

GGLTAGCTACAACGA 


GTGTCCCA 


2092 


UGGGACAC 


G 


GCCUUGUC 


4871 


X U J. / 


r^/^r'PPP Ar^ 


OOOT" A OOT* A OA A OO A 

GGL TAGCTACAACGA 


TV TV /^ /*T /^^^ m/^ 

AAGGCGTG 


2093 


CACGCCUU 


G 


GUCCCCCG 


4872 


1 r> 9 c; 
X u z o 


GCGl AUAU 


OOOT" A OOT^ A OA A OO A 

GGL i AGLTACAACGA 


/^ /^ /^ /^ ^ TV 

CGGGGGAC 


2094 


GUCCCCCG 


G 


GUGUACGC 


4873 


1 n9 7 
X uz / 


V-GGv-Gi AL- 


OOOTi A OOT^ A OA A OO A 

GGL i AGLTACAACGA 


ACCGGGGG 


2095 


CCCCCGGU 


G 


GUACGCCG 


4874 


xuz y 


G J. GGGGGl 


OOOT'AOOT'A OA A OO A 

GGL i AGLTACAACGA 


ACACCGGG 


2096 


CCCGGUGU 


A 


ACGCCGAG 


4875 


1 n "J 1 
X U J 1 


G i C i GGGG 


GGCTAGCTACAACGA 


GTACACCG 


2097 


CGGUGUAC 


G 


GCCGAGAC 


4876 


X U J / 


IGGi IGGl 


/^^^ TV ✓*^m TV ^ TV TV /^/^ TV 

GGCTAGCTACAACGA 


CTCGGCGT 


2098 


ACGCCGAG 


A 


ACCAAGCA 


4877 


XU^Z 


GGAAGTGC 


/^rT> TV /"If 1 1 TV TV TV /^/^ TV 

GGCTAGCTACAACGA 


TTGGTCTC 


2099 


GAGACCAA 


G 


GCACUUCC 


4878 


1 n A yi 


^ A A A Orp 

GAGGAAGT 


/^nn TV /*v/"vi 1 1 TV tv tv ^^/^ tv 

GGCTAGCTACAACGA 


GCTTGGTC 


2100 


GACCAAGC 


A 


ACUUCCUC 


4879 


XU3 J 


i GAGGAG i 


GGCTAGCTACAACGA 


AGAGGAAG 


2101 


CUUCCUCU 


A 


ACUCCUCA 


4880 


1062 


CTTGTCGC 


GGCTAGCTACAACGA 




7 1 n 7 
z ± \J z 


PTTPPTTOnO 


o 
G 


O OO A O A A O 

GLGALAAG 


4881 


1065 


CTCCTTGT 


GGCTAGCTACAACGA 


CGCCTGAG 


2103 


CUCAGGCG 


A 


ACAAGGAG 


4882 


1072 


GCAGCTGC 


GGCTAGCTACAACGA 


TCCTTGTC 


2104 


GACAAGGA 


G 


GCAGCUGC 


4883 


1075 


GCCGCAGC 


GGCTAGCTACAACGA 


TGCTCCTT 


2105 


AAGGAGCA 


G 


GCUGCGGC 


4884 


1078 


AGGGCCGC 


GGCTAGCTACAACGA 


AGCTGCTC 


2106 


GAGCAGCU 


G 


GCGGCCCU 


4885 


1081 


AGGAGGGC 


GGCTAGCTACAACGA 


CGCAGCTG 


2107 


CAGCUGCG 


G 


GCCCUCCU 


4886 
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1 nQ7 




zlUo 


UGGUUGGU A AGUGAGGU 


>i 0 0 ^ 






z loy 


GGUAlUlA g gcugugug 


4888 






zllU 


T TOT TOT TO A O OOOOAOOO 

UGULUGAG G GGLGAGGL 


4 88 9 


X X X J 




zl 11 


OAOOOOOA 0 OOOTTOAOTT 

GAGGLGLA G GGLUGAGU 


4890 


111ft 

± X X O 




OTTO 

zllz 


CCAGCCUG A ACUGGCGC 


4891 


X X Z Z 


r*cr^i^r^CT*r' rTr^TArr^fA/^A ao/^a /^a/^hp/^a/^/^ 
<^UVjAVjL-*jC i A^jL. i AL.AACvjA CAkjiLAOOj 


OTTO 
^ Hi 


CCUGACUG G GCGCUCGG 


4892 


1 1 OA 
X X Z 4 


U 1 L^CVaAo^ ijoC 1 AoL. i AL»AA^-vjA kjL.L.Atj i L.A 


0 1 T /I 

z 114 


TTOAOTTOOO O OOTTOOOAO 

UGALUGGl g gcucggag 


4893 


X X J z 


L«L.A\.oAv3L. UoL. 1 AuL. 1 AL.AAL.urA L. 1 LL.0A0L. 


zllb 


GCUCGGAG G GCUCGUGG 


4 894 


1 1 1 c 
1 ± J D 


1 L. i L.L.AL LjLjL. i ACjL. i AL,AALCjA GAGCCTCC 


2116 


GGAGGCUC G GUGGAGAC 


4895 


1 Z 


AALrAHaVji bCjL i ALiL i ALAALvjA CTLCACGA 


2117 


UCGUGGAG A ACCAUCUU 


4896 


X Xft D 


AVjAAALjA 1 LjLjL i AVjL. i AL, AALLjA (iLr 1 L I CCA 


2118 


T T/^ TV TV /^/^ TV TV T T/^T TT TT T/^T T 

UGGAGACC A AUCUUUCU 


4897 


1 1 t: c 
J. ±33 


C L i LjoAAL Lj(jC i AvjC 1 ALAACCjrA CCAGAAAG 


2119 


CUUUCUGG G GUUCCAGG 


4898 


1 1 C O 


i L L. AGGGC GGC i AGC J. ACAACG A CTGGAACC 


2120 


^*|/^T TT T^^^l TV A*l f^f^f^'XT T/^ TV 

GGUUCCAG G GCCCUGGA 


4899 


X X O j7 


L-V-iLr^jLAi tjVjL i AoL. i Av^AAL-GA Cv^AGGGUC 


0 1 0 T 

z Iz 1 


GGCCCUGG A AUG CC AGG 


4900 


1171 

X X / X 


1 L.L.V- ioVjv„ vavjL.iAV3L.iAL-AAL.GA AiCL-AGGG 


zlzz 


OOOTTOOATT O OOOAOOOA 

GGCUGGAU G GCCAGGGA 


4 901 


1178 


L. VjVjVjVj/4Vj i LjVjL, i AL7L, i AL, AAGGA G L. L. 1 GGG A 


9 1 0 0 

z Iz J 


UGGGAGGG A AGUGGGCG 


4902 


118 5 


V... AAv^V_ i VjL. VjVjL. i AoL. 1 AG AAGGA GGGGAG i G 


zlz4 


GACUCCCC G GCAGGUUG 


4903 


1 1 R Q 

X X O 17 


LjLiLjLiVwAAL LjLiU 1 AGG i AL AAGGA G i GGGGGG 


2125 


^ /^/^ TV ^^T TT T^ /^/^ /^/^ 

CCCCGCAG G GUUGCCCC 


4904 


1 1 Q9 

X X 7Z 


CLnc^a/'ijnn.r* pppTapp'paPA appa a A/^/^T/^r^/^ 
LjVjV^VjVjvjVjL, LjLjV- i AGG i AG AAGGA AALG 1 GGG 


2126 


GGGAGGUU G GCCCCGCC 


4905 


1 1 Q7 

X X 7 / 


n.dfz.C'^rLf^r* PPPT'app'paPA appa r*r*r*r*r^T<Tir* 
oVjVjLAooU GGG i AGG i AG AAGGA GGGGGAAL 


2127 


GUUGCCCC G GCCUGCCC 


4906 


X Z U X 


nr^'vcfif^f^c* ppp'pappT' A PA A^/^ A a^o/^o/^o^ 
^jL 1 vjVjVjijC GGG i AGG i AGAAGGA AGGGGGGG 


2128 


CCCCGCCU G GCCCCAGC 


4907 


1 907 
X z u / 


APTAPPPP P/^^T" Ar'/^T' A 0 A A /^/^ A T'O/^/^/^/^ A 

ALi i ALjLGG GGL i AGG i AL AAGGA i GGGGLAG 


2129 


CUGCCCCA G GCGCUACU 


4908 


1 on Q 

X Z U J7 


L L ALj i AvjG GGG i AGG i AGAAGGA GG 1 GGGGG 


2130 


GCCCCAGC G GCUACUGG 


4909 


1212 


f'mPPa/TlT' PPPTAPPfaPA ar^/^a Aoorr^mo 
1 i LiV^ v^ Avj i GGG i AGG i AGAAGGA AGGGG i GG 


2131 


TV /^T T TV TV /^TT/^^^/^TV TV 

CCAGCGCU A ACUGGCAA 


4910 


1216 


npaTTTPP pnpT'a<^PTapa appa r*'h.r*'v^r'r'r* 
\j v_ A i i i VjL. VjVjG i AGG i AGAAGGA GAG 1 AGGG 


0100 
2 13z 


00/^TAOTTO 0 /^^AAATT^^ 

CGCUACUG G GCAAAUGC 


4911 


19 9 0 
X z z u 


r^riPPriPaT" PPPT'app'paPA appa 'T"T'/^/^/^Ar"T' 
LjVjL,HaLAi LjLjL- i AGG 1 AL AAGGA iiGGGAGi 


2133 


/IT T/^ ^ '1\ 7\ T\ TV T T/^ /^y^ ^ 

ACUGGCAA A AUGCGGCC 


4912 


1 999 
X z z z 


ni^^nPPPP PPPT* APPT* A P A A PP A A 'P'P'PO/^O A 

VjVjLjVjL.L.L»L. Vjljui AGG i AGAAGGA AiiiGLGA 


2134 


UGGCAAAU G GCGGCCCC 


4913 


1 99(^ 

X Z Z D 


apa^r2^^p PPPT'app'paPA APPA r*r*r*T<fT**T^T^r* 
ALAVjVjvjoL. ouG 1 AGG i AGAAGGA GGLAl 1 i G 


2135 


CAAAUGCG G GCCCCUGU 


4914 


X Z J X 


PPAPAAA/^ /^PPT" A^OT" A 0 A A A 

L L AG AAAG GGG x AGG 1 AGAAGGA AGGGGCCG 


2136 


CGGCCCCU G GUUUCUGG 


4915 


X Z U 


PAAP/^A/^/^ ^A^r^T^A/^/^TiA ^A A A rn^OA^^AAA 

GAAGGAGG GGG i AGG T AC AAGGA TCCAGAAA 


2137 


UUUCUGGA G GCUGCUUG 


4916 


1 9 A "3 


IGGGAAGG GGG i AGG TAC AAGGA AGCTCCAG 


2138 


CUGGAGCU G GCUUGGGA 


4917 


1 9C^1 
J.Z DX 


GGGG 1 GG i GGG TAGCT AGAAGGA TCCCAAGC 


2139 


GCUUGGGA A ACCACGCG 


4918 


1 9 c: 4 
X Z D 'I 


LiGLGLGi GGG i AGGTACAACGA GGTTCCGA 


2140 


UGGGAACC A ACGCGCAG 


4919 


1 

X Z 3 O 


C*'ti.C^'T^C^C^C*C^ PPPT" A/^^^T'A PA A 0^ A 0'TT^O'T''P^^ 

^AL- 1 LiL.L3t^ GGG i AGG iAL AAGGA GlGGilCC 


2141 


^ TV TV ^ /^TV y^rf^ TV ^T T^ 

GGAACCAC G GCGCAGUG 


4920 


X Z 3 O 


PPPAPT'PP PPPT* AP/^T* A A A A /~1 /-i/^rri/^/^rprp 

VjVjL.AL. i LjVjG i AGG i AGAAGGA GGG 1 GG i i 


2142 


AACCACGC G GCAGUGCC 


4921 


1261 


a<^/Tl/^r2PaP PPP'PapPT'a pa A pp A *x^r*r*r*r^r**T*f^ 
AVjLjLtvjV^ AL. vjVjL. i A^jG i AGAAGGA i GGGGG i G 


2143 


CACGCGCA G GUGCCCCU 


4922 


XZO J 


PT"APPPPP /""P (^T A 0 /^T* A A A A A /^T"*^ 

G i AGGGG L GGL. i AGL i AL AAGGA ACTGCGCG 


2144 


CGCGCAGU G GCCCCUAC 


4923 


1 9^; Q 
X Z D 


GAGLGGGi GGG i AG C i AC AAGGA AGGGGCAC 


2145 


GUGCCCCU A ACGGGGUG 


4924 


19 74 
XZ / ft 


AP/^A/^OA/^ /^/^ A /^Tl A ^ A A A ^/^/^^fn A /^/^ 

AGGAGGAG GGG i AGC i AGAAGGA CCCGTAGG 


2146 


CCUACGGG G GUGCUCCU 


4925 


1 97C 
X Z / O 


T'PA/^^AOO ^^^fp A O/^Tl A ^ A A /^/^ A A^y^^/^/^mA 

iGAGGAGG GGG i AGGTACAACGA ACCCCGTA 


2147 


UACGGGGU G GCUCCUCA 


4926 


1 9 Q£ 
i.Z OO 


GAG 1 GGG 1 GGG i AGCTACAACGA CTTGAGGA 


2148 


UCCUCAAG A ACGCACUG 


4927 


1288 


GGCAGTGC GGCTAGCTACAACGA GTCTTGAR 


214 9 


PTTPZiAnziP n npaPTTppp 

V_ U V_/\rtVj/\l_ Vj VjV^/\LUoL-L 


i1 Q 0 Q 

4 y z 0 


1290 


CGGGCAGT GGCTAGCTACAACGA GCGTCTTG 


2150 


CAAGACGC A ACUGCCCG 


4929 


1293 


CAGCGGGC GGCTAGCTACAACGA AGTGCGTC 


2151 


GACGCACU G GCCCGCUG 


4930 


1297 


CTCGCAGC GGCTAGCTACAACGA GGGCAGTG 


2152 


CACUGCCC G GCUGCGAG 


4931 


1300 


CAGCTCGC GGCTAGCTACAACGA AGCGGGCA 


2153 


UGCCCGCU G GCGAGCUG 


4932 


1304 


ACCGCAGC GGCTAGCTACAACGA TCGCAGCG 


2154 


CGCUGCGA G GCUGCGGU 


4933 
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1 7n7 




9 T C C 


TTPPPAPPTT P PPPPTTPAP 

UGGGAGGU G GGGGUGAG 


>l Q 9 /I 


1 ^ 1 n 

X J ± u 






GAGGUGGG G GULAGGLL 


/I Q 1 C 

4y Jb 


J.J±J 




oner? 
zlb / 


GuGGGGUG A ACCCCAGC 


4936 


J. J J. 7 


r*c*r*r*r^^c*f^ /^opt'ap/^'t^a /^a a a t'o/^^/^'p/^a 
L. i tsL ooL. i. ALjL. 1 ALJ^AlvjA xCjCjCjCj i (j A 


2158 


UCACCCCA G GCAGCCGG 


4937 




A p A r^r^r^r^c* /np/^'PAPPnr'APA appa T'/^^'Pi^/^or* 
AUAuUvjoL. LHjL.iAvjV-iAv.J\ALt3A TGCTGGGG 


2159 


CCCCAGCA G GCCGGUGU 


4938 


1 T 9 <C 
X O ^ O 


APAPAPAP r^pr^T' A ^^T" A ^ A A A r^f^r^r^TT^r^T^ 

AUAtjACAL LjUL i ACjC i ACAALCjA LLtCjL i LjL i 


2160 


AGGAGGCG G GUGUCUGU 


4939 


1 7 9 ft 
X J ^ o 


PPAP&PAP PPPT* APPT A P A A PP A A P/^O/^OT'O 

LjU AL AVjAL- 00 L. i AGV- i AC AAL. GA AL. L. GG L i G 


2161 


GAGGGGGU G GUCUGUGC 


4940 


1 7'59 
J. J J 4!: 


LV^LnjoLAL. GGL. i AGG 1 AGAAGGA AGAUAGGG 


2162 


CGGUGUCU G GUGCCCGG 


4941 


X J J t 


Tnc^nnnnn nfiPTZ^^^PTZiPaappa ^r''Ar*'Ar*'Ar* 

1 L> LovjoL. vjvjL 1 AoL. 1 AL^AAGoA AG AGAG AG 


9 T ^ T 

z lb J 


GUGUGUGU G GCCCGGGA 


4942 


1 "5 d. R 
X J t o 


cr*'vr*(~*r^r^r^ ppptapptapa appa t^fptt^/^/^/^ 
L.LivjvjUL3U GGL i AGG i AGAALGA liGiGGGG 


2164 


CGGGAGAA G GCCCCAGG 


4943 


X J D J 


P2iP2inanp ppptapptapa app a r*f^^r*r*r*r^r* 
L. AL-AvjAoL. VjoL i AoL. i AGAAGGA G G 1 GGGGG 


z lob 


GGGGGAGG G GCUCUGUG 


4944 


X J D O 


n.r^r*f^r^r*'^f^ ppptapptiapr appa APArr'/^r'T' 
0LL.0V..UAG GGG 1 AGG 1 AGAAGGA AGAGGGG I 




AGGGCUCU G GUGGCGGC 


494 5 


X J D ± 


GGGGG GGG GGGl AGGTAGAACGA CACAGAGC 


2167 


GCUCUGUG G GCGGCCCC 


4946 


X J D ft 


Tcr^cr^rr^r* ppptappt'a/^a a/^/^a o<^o/^a/^a/^ 
i HjGoGGL GGG 1 AGG i AGAAGGA GGG GA GAG 


2168 


CUGUGGCG G GCCCCCGA 


4947 


X J o u 


nTPTPTPPT* PPPTAPPTAPA A PP A PPTT'OO^Or^ 

VjlL-lLjlLji GGG i Ala V- i AGAAGGA GGlGGiGG 


216 9 


7\ /*^/^ TV TV TV /~l TV TV TV /~l 

GGAGGAGG A ACACAGAC 


4948 


X J o z 


nnn'vr^'pr*'P pppt'app'papa appa ptp/^tt'O't' 
Vjvtvj 1 L. i. 0 1 LjuGI AGG 1 AGAAGGA GiGGlGGi 


2170 


TV TV TV TV TV TV TV 

AGGAGGAC A ACAGACCC 


4949 


■ 1 T ft ^; 

X J o o 


A^-^oIjVj'j 1 vjvjL. i AGG 1 AGAAGGA G 1 G 1 G 1 GG 


2171 


GGACACAG A ACCCCCGU 


4950 


X J J7 ^ 


PAnPPf^BP PPPHPAPPTAPA APPA PPP/^r«'T>r"T' 

L-A^Lji-oAL vjov_ lAoL. i AGAAGGA GGGGGiGi 


2172 


AGACCCCC G GUCGCCUG 


4951 


X J j7 D 


C'h.rr^'b.nrir^ pppt'apptapa appa r»Ts.r*r*r*r*r*r^ 
L. AL. L ALjvjL VjVjL, i Ao G i AG AAG GA G AGGGGGG 


2173 


CCCCCGUC G GCCUGGUG 


4952 


1400 


AvjL. 1 V7LAL 1 AVjL i ALJu\GGA GAGGGGAG 


2174 


GUGGGGUG G GUGCAGCU 


4953 


X ft VJ z 


rSPa/^PTPP PPPTAPPTAPA APP A APPAr«/^r«/^ 

GGAGGiGG GGG i AGG 1, AGAAGGA AGGAGGCG 


2175 


/*10/nOTTOOTT 0 OOTVOOTT/IO 

CGCCUGGU G GCAGCUGC 


4954 


T 4 n c; 

X*± VJ D 


vjLjALjGAvjG LjLjL. i AGL i AGAAGGA 1 GGALGAG 


2176 


CUGGUGCA G GCUGCUCC 


4955 


1 ADR 
X*4 U O 


r^c*r^r^t^^r*r^ pppt'apot'a^a a/^/^a a^^tio/^7\<^ 
GGGGGAGG GGG i AGG i AGAAGGA AGCTGCAC 


2177 


GUGCAGCU G GCUCCGCC 


4956 


1 4.1 
X ft X J 


njvnc*'vnjr*r* ppptrpptapa appa /t!/^ao/^a/^/^ 
Vj 1 oL 1 vjVjL, LsoG 1 AGG 1 AGAAGGA GGAGGAGG 


2178 


GCUGCUCC G GCCAGCAC 


4957 


14 17 
X ^ X / 


TnPT'PT'PP PPPT APPT AP A A pp A T^/^/^^^/^O AO 

i vjL, ILj i LjL. oLjG 1 AGG i AGAAGGA i GGGGGAG 


2179 


CUCCGCCA G GCACAGCA 


4958 


1 4 1 Q 
X *i X -? 


rSPT'^P'PPT' PPPT APPT" A P A A /^/^ A r^r^^r^f^r^rr^ 
LrLUjLUji vjvjG 1 AGG i AGAAGGA GGi GGGGG 


2180 


CCGCCAGC A ACAGCAGC 


4959 


1422 


rtnci/^r^TCLr* ppptapptapa ap/^a fr^r^rnf^f^mr*/-* 
oVjLjVjL. 1 oL. Vj\jL i AGG i AGAAGGA i G i GG 1 GG 


2181 


CCAGCACA G GCAGCCCC 


4960 


X *t ^ 3 


c*r*^rijnnnf^ ppp'pappt'apa appa T*r*r*fT*r*rT>f^t^ 
L L ALtVjLivjL. GGG 1 AGG i AGAAGGA 1 GG 1 G 1 GG 


2182 


GCACAGCA G GCCCCUGG 


4961 


143 2 


APAPPTnp f^r2PTZi<^PT'aPAappa r^'h.r^nnr*^^ 
L i vjL. ooL. i. ALjL 1 AL~AAGGA GAGGGGG 1 


2183 


AGCCCCUG G GCAGGUGU 


4962 


1 4 <^ 

X*i O O 


PPPT'APAP PPPT" A /^/^T' A ^ A A /^/^ A t^T^r^r^r^Ty r*ry 

GGG i AG AG GGG J. AGG x AGAAGGA GTGCCAGG 


2184 


CCUGGCAG G GUGUACGG 


4963 


1 4 ft 


'^nc^r^r^^'h.r^ ppp'pappt'apa a a Ttr^r*fT>r*r*r*T^ 
AVjLLVjiAL- GGG i AGG i AGAAGGA AGGiGGGA 


2185 


UGGCAGGU G GUACGGCU 


4964 


14 4 0 
X •» ft \J 


A A/TJOPPT' PPPT A PPT A P A A 00 A AOAOO'n*^/^ 

LjAAvjL LLj i GGG i AGG i AGAAGGA AG AGG 1 GG 


2186 


GCAGGUGU A ACGGCUUC 


4965 


1 4 4 

X fl *i ^ 


PAPPAAPP PPPTAPPTAPA A/^OA O/^TtA/^AO/^ 

L-ALUAAGG GGG 1 AGG i AGAAGGA GGTACAGG 


2187 


GGUGUACG G GCUUCGUG 


4966 


1 4 4 P 


r^c^r^r^c^r^T^r^ ppp'pappt'apa a a /^a A/^^/~</^rn 
GLGGGGAG GGG i AGG i AGAAGGA GAAGCGGT 


2188 


ACGGCUUC G GUGCGGGC 


4967 


1 4 RO 
X ft 3 U 


T^cu^r^r^r^r^r^ pppTAppfPAOA a/^oa a/^^tv 7vo/^<^ 
AGGGGGGG GGG i AGG i AGAAGGA ACGAAGCC 


2189 


GGCUUCGU G GCGGGCCU 


4968 


1 4 e;4 
Xft 3 ft 


^r^c^r^^r^f^r^ pp/^taootiaoa A^^n o/^o^h/^/^tv 
AGGGAGGG GGGTAGGTACAACGA CCGCACGA 


2190 


UCGUGCGG G GCCUGCCU 


4969 


Xfi 3 O 


GGGGAGGG GGL 1 AGG i ALAACGA AGGCCCGC 


2191 


GCGGGCCU G GCCUGCGC 


4970 


1 4 fi9 
X fx O Z 


r^(~^r^(~*r^r^r^ pf^/~"T'A/^oT'A<^A a^otv 7\^o/^t\^/^/^ 
GGGGGGGG GGL 1 AGG i AGAALGA AGGCAGGC 


2192 


GCCUGCCU G GCGCCGGC 


4971 


1 4^^4 
X 1 O ft 


GAGGGGGG GGG i. AGG 1 AGAAGGA GCAGGCAG 


2193 


CUGCCUGC G GCCGGCUG 


4972 


1 4£fl 


GGAGGAGG GGGTAGGTACAACGA CGGCGCAG 


2194 


CUGCGCCG G GCUGGUGC 


4973 


1 4 79 
Xft / Z 


GGGGGGAG GGCTAGL I ACAACGA CAGCCGGC 


2195 


GCCGGCUG G GUGCCCCC 


4974 


1474 


CTGGGGGC GGGTAGGTACAACGA ACCAGCCG 


2196 




/I Q 9 C 

fi y / b 


1482 


CCAGAGGC GGCTAGCTACAACGA CTGGGGGC 


2197 


GCCCCCAG G GCCUCUGG 


4976 


1491 


CCTGGAGC GGCTAGCTACAACGA CCCAGAGG 


2198 


CCUCUGGG G GCUCCAGG 


4977 


1498 


CGTTGTGC GGCTAGCTACAACGA CTGGAGCC 


2199 


GGCUCCAG G GCACAACG 


4978 


1500 


TTCGTTGT GGCTAGCTACAACGA GCCTGGAG 


2200 


CUCCAGGC A ACAACGAA 


4979 


1503 


GCGTTCGT GGCTAGCTACAACGA TGTGCCTG 


2201 


CAGGCACA A ACGAACGC 


4980 
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J. 3 U / 




O O o 


/"'AOAAOOA A A/^O^/^/^OTT 

CACAACGA A ACGCCGCU 


4981 






o o n o 
^ z U J 


/^AA/^r*AA/^ O /^/~i/~l/~l /"IT TT 

LAALGAAL G GLLGGuUL 


/I Q O O 

4y 




LrALrLxAAoV^ vjvjL. 1 ALjL. i AuAAUoA LtLjUj i i Lo 


2204 


LGAALGLC G GCUUCCUC 


4 983 




L.ilL7V3il7l viLrL.lALTL.lACAAL.CjA ILLlLiALrLj 


2205 


O A^T T/^ A/~«/^A A A*^A/*^^AA/~» 

LLULAGGA A ACACCAAG 


4 984 




1 1 U 1 1 LjLi 1 ULjL. 1 AtjL. 1 AL.AACC3A GTTCCTGA 


2206 


UCAGGAAC A ACCAAGAA 


4985 


J. 0 J 4 


A^ATT^AA/^ 0^/~"T'7\/^^'T*7\/^A AO/^7\ rprpr^i 1 11 n^/-^rp 

AUA 1 LtAAL LxLjL 1 ACjL 1 ACAAL-LjA TTCTTGGT 


22 07 


A/^/^AA/^A7\ /^T TT T<^ 7\ T T/^T T 

ACCAAGAA G GUUCAUCU 


4986 




AoooAkjA 1 oGL. 1 AGu 1 Av^AAL.L» A GAAL 1 1 L 1 


O O A Q 


A A A /"IT TT T/"' A A T T/-1T T/^/~l/~»T T 

AGAAGuUC A AUCUCCCU 


4987 




lul3L.AloU IjVjL. 1 AvjL 1 ALJiAL,oA 11L.LL.L.AG 


2209 


/-IT T/~» /^ /^ /~1 A A /"I /"I /^ A T T/^ /~i A 

CUGGGGAA G GCAUGCCA 


4988 


J. 3 Dfi 


L.1 ILjljv-Al LHjL. 1 ALjL. 1 AL.AAL,(jA GL110L.LL. 


2210 


GGGGAAGC A AUGCCAAG 


4989 


X J D O 


AVaUiivjoL LruL. i AGL, i ALAAL.GA AIGLIILL 


2211 


r^/^ TV TV TV T T TV Tl /^T T 

GGAAGCAU G GCCAAGCU 


4 990 


X J. 


Vj^^VjAVjrALjL, VjvjL 1 AOL. 1 AL.AAL.LjA 1 1 GLiLA 1 G 


OTTO 


LAUGLGAA G GCUCUCGC 


4991 


J. D O / 


L. L. 1 LjL. AoL (jLtL. 1 AVjV- 1 ALAAV-urA oAoAGL. 1 1 


13 


AAGCUCUC G GCUGCAGG 


4992 


J. 3 / U 


nr*^r*r*'vr*r* /"s/^OTAr^r'TAr'a apoa AO/^/^Ar'A/^ 
oL. 1 UL. 1 oL VjVjL. 1 AoL. 1 AuAAL-vjA AVjCVjAVjAo 


2214 


/-IT T/^T T/^/"l /^T T /-I i^/^ A /^/^ A /^/^ 

CUCUCGCU G GCAGGAGC 


4993 


X D / O 


ALLj 1 LAvjL LjGL 1 AGL 1 AL-AACoA i. LL i GLAG 


2215 


/^T T/-1 A /"I /"< A /-» /IT T/-1 TV /^/~1T T 

CUG CAGGA G GCUGACGU 


4994 


±DO\J 


1 ILUALLrl LjGu 1 ALjL. 1 ALAAv^GA LAGLiLUl 




A Z"*/^ A /-» /-IT TO A A /~1/*1T T/"«/~» A A 

AGGAGCUG A ACGUGGAA 


4 995 


X 3 O ^ 


1 L X 1 L.LAV- IjoL. 1 AoL i ALAAL-VjA Vj 1 L.AGL 1 L. 


2217 


Z^ A O /^T TO A O /"IT TO A TV O TV 

GAGCUGAC G GUGGAAGA 


4996 


X307 


ALL»v- 1 LA 1 ooL. 1 ALiL 1 ACAAUviA L.TTCCACG 


2218 


OOT TOO A A O A AT TO A O OOT T 

CGUGGAAG A AUGAGCGU 


4997 


X D 17 J 


C*C^r^r^7<C*r^r^ *^/~i>^mAOr^'T'A^A A^OA 'T'/^AT"/^'T"T"/^ 

ULVjLALLjL LjLjL i AGL i ALAAUGA ILAiLliG 


2219 


TV TV TV T TV /~t /^/^T T/^ /^/^/^ 

GAAGAUGA G GCGUGCGG 


4998 




^r^r^r^c^r^'h.r^ p'p/^'t'a/^o't'a/^a a^/^a r^(^n^r^T\'T>r*'T 
ILLGGLAG GGG 1 AGL 1 AGAAGGA GLlLAICr 


2220 


TV ^ TV T T/^ TV /^T T/^ /^/^ T\ 

AG AUG AGC G GUGCGGGA 


4999 


X 3 J7 / 


AVj 1 v^L.L.VjL. ooL. 1 AvjL 1 AGAAGGA ALGL 1 GAl 




ATTOAOOOTT O OOOOOTVOTT 

AUGAGCGU G GCGGGACU 


5000 


X O U <S 


AVjL.VjL.Ao 1 ovjL. 1 AVjL. 1 AL.AAGGA GLGGGALG 


2222 


/.^/-IT TO 00/-t^ A A OTTOOOOTT 

CGUGCGGG A ACUGCGCU 


5001 


1 n c: 

X O U D 


r^r^t^T^r^r^c^r* /^/^/^t'a/^/^'t'a/^a a/^/^a T^r*'Tr*r*r^r*r^ 
LLAALjLGL GGL i AGL i ALAAGGA AG i LLLGG 


2223 


GCGGGACU G GCGCUUGG 


5002 


X o u / 


AGLLAAGL GGG 1 AGG 1 AGAAGGA GGAGTCCC 


2224 


GGGACUGC G GCUUGGCU 


5003 


1^19 
X o x^ 


1 GGGG AGG GGG 1 AGG 1 AGAAGGA CAAGLGGA 


2225 


UGCGCUUG G GCUGCGCA 


5004 


X O X D 


^r^f^'pr*r*r*r^ nr^ ot'ao^t'a/^a a a a/^ooaa/^/^ 
IGGIGLGG GGGl AGG 1 AGAAGGA AGCCAAGC 


2226 


GCUUGGCU G GCGCAGGA 


5005 


1 f^T 7 
X O X / 


^jLiLGiGG GGL J. AGG i AGAAGGA GLAGGLAA 


2227 


UUGGCUGC G GCAGGAGC 


5006 


X U 4^ 


L.L.L ILjLtVjV^ LtLjL. 1 AGG 1 AGAAGGA IGGIGLGG 


2228 


GCGCAGGA G GCCCAGGG 


5007 


XO J X 


L. ALru L AAG GGG 1 AG G 1 AGAAGGA G L G 1 GGGG 


2229 


GCCCAGGG G GUUGGCUG 


5008 


163 5 


azir'APZ^nr* /Ti(Tir"parT"pnoAA/^PA oaa/^/^o^t 
/\rt.L./^i^A\jL ooL. lAoL-i AGAAGGA GAALGGGl 


2230 


AOOOOTTTTO O O OT TOTTOT TT T 

AGGGGUUG G GCUGUGUU 


5009 


163 8 


^.^LjVjAALAL ooL iAoL 1 ALAAGGA AGGLAALG 


2231 


OOTTTTOOOTT O OTTOTTTTOOO 

GGUUGGCU G GUGUUCCG 


5010 


X O *1 V 


f^f^f^nnAAf^ nnr^TfA/TLr^PA/^A A/^/^ A A/^A/^/^/^AA 
OV^ L ooAAL. oVjL. 1 AoG 1 AG AAG G A AG AG G G AA 


2232 


UUGGCUGU G GUUCCGGC 


5011 


X D ^ D 


'vr*''vr*r^/^r*c* /i/^/^'pao/^t'aoa a/^*^a /^/^/^aa^a/^ 
I G 1 GGGGG GGG 1 AGG 1 AGAAGGA CGGAACAC 


2233 


GUGUUCCG G GCCGCAGA 


5012 


X D J7 


TT^ r^TT^T^r^ O/^/^iPAO^'PA^^A A^^A /^/^^/^^^atv 

1 Lj L. i L i GL GGG 1 AGG 1 AGAAGGA GGCCGGAA 


2234 


UUCCGGCC G GCAGAGCA 


5013 


1 <^ R 4. 
X O 3 fi 


GALGG i GG GGG i AGG xAGAACGA TCTGCGGC 


2235 


GCCGCAGA G GCACCGUC 


5014 


XO 3 O 


LAGAGGG 1 GGG 1 AGG xAGAACGA GCTCTGCG 


2236 


CGCAGAGC A ACCGUCUG 


5015 


1 Q 
XO 3 27 


A/^O/^A/^AO /^^Otp A /^/^fp A A A TR rri/"i / nii/^fn 

AGGGAGAG GGL 1 AGG xAGAACGA GGTGCTC T 


2237 


AGAGCACC G GUCUGCGU 


5016 


X D D O 


(^(^T'l^Ti r^r~*r^ /^/^/^'pa/^/^'Paoa a/^/^a a/^a/^/^/~ht»/^ 
LGiL-AGGG GGG i AGGTAGAACGA AGACGGTG 


2238 


CACCGUCU G GCGUGAGG 


5017 


T C c: 
X D O Z> 


L i GL 1 GAG GGG 1 AGCT AGAAGGA GCAGACGG 


2239 


CCGUCUGC G GUGAGGAG 


5018 


X O / 3 


GGGAGGAl GGGT AGGTAGAACGA CTCCTCAC 


2240 


GUGAGGAG A AUCCUGGC 


5019 


1 C 7 Q 
XO / 7 


AAG 1 1 GGG GGCTAGCTACAACGA CAGGA TCT 


2241 


AGAUCCUG G GCCAAGUU 


5020 


1 

XOOfc 


GGAGGAAG GGL i AGC rACAACGA TTGGCCAG 


2242 


CUGGCCAA G GUUCCUGC 


5021 


1690 


GCCAGTGC GGCTAGCTACAACGA AGGAACTT 


2243 




c: n 9 9 
3 u zz 


1692 


CAGCCAGT GGCTAGCTACAACGA GCAGGAAC 


2244 


GUUCCUGC A ACUGGCUG 


5023 


1696 


TCATCAGC GGCTAGCTACAACGA CAGTGCAG 


2245 


CUGCACUG G GCUGAUGA 


5024 


1700 


ACACTCAT GGCTAGCTACAACGA CAGCCAGT 


2246 


ACUGGCUG A AUGAGUGU 


5025 


1704 


GTACACAC GGCTAGCTACAACGA TCATCAGC 


2247 


GCUGAUGA G GUGUGUAC 


5026 


1706 


ACGTACAC GGCTAGCTACAACGA ACTCATCA 


2248 


UGAUGAGU G GUGUACGU 


5027 
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1 1 A O 

170o 


CCaACGTAC GGCTAGCTACAACGA ACACTCAT 


2249 


AUGAGUGU G GUACGUCG 


5028 


1710 


GACGACGT GGCTAGCTACAACGA ACACACTC 


2250 


GAGUGUGU A ACGUCGUC 


5029 


X I LZ 


i LGALGAL GGCTAGCTACAACGA GTACACAC 


2251 


GUGUGUAC G GUCGUCGA 


503 0 


1715 


TV /^^nn^/^ TV /^/^ /^m tv /~i /^rri tv tv tv /^/^ tv tv m tv tv 

AGCTCGAC GGCTAGCTACAACGA GACGTACA 


2252 


UGUACGUC G GUCGAGCU 


5031 


1720 


TGAGCAGC GGCTAGCTACAACGA TCGACGAC 


2253 


Ty^ y^T xy^y^ tv y^ y^. t^*^ tl 

GUCGUCGA G GCUGCUCA 


5032 


1723 


ACCTGAGC GGCTAGCTACAACGA AGCTCGAC 


2254 


GUCGAGCU G GCUCAGGU 


5033 


172 9 


TV TV ^ TV A TV /*ipn tv /*i tv tv /^/~% tv y^rn^ tv -n 

AGAAAGAC GGCTAGCTACAACGA CTGAGCAG 


2255 


CUGCUCAG G GUCUUUCU 


5034 


1740 


/i/^m/^ TV TV m /~i /^m tv m tv ^ tv tv /^/^ tv tv tv tv tv tv tv tv 

CGTGACAT GGCTAGCTACAACGA AAAAGAAA 


2256 


UUUCUUUU A AUGUCACG 


5035 


1742 


iTT^^ TV ^i^^^^^n TV ^t^tm tv ^% tv tv tv tv m tv tv t^ tv tv 

TCCGTGAC GGCTAGCTACAACGA ATAAAAGA 


2257 


UCUUUUAU G GUCACGGA 


5036 


174 5 


GTCTCCGT GGCTAGCTACAACGA GACATAAA 


2258 


UUUAUGUC A ACGGAGAC 


5037 


1751 


AACGTGGT GGCTAGCTACAACGA CTCCGTGA 


2259 


UCACGGAG A ACCACGUU 


5038 


1754 


TGAAACGT GGCTAGCTACAACGA GGTCTCCG 


2260 


CGGAGACC A ACGUUUCA 


5039 


1756 


TTTGAAAC GGCTAGCTACAACGA GTGGTCTC 


2261 


GAGACCAC G GUUUCAAA 


5040 


1767 


GAGCCTGT GGCTAGCTACAACGA TCTTTTGA 


2262 


UCAAAAGA A ACAGGCUC 


5041 


± / / 1 


AAAAGAGC GGCTAGCTACAACGA CTGTTCTT 


2263 


AAGAACAG G GCUCUUUU 


5042 


1 T Q O 


C 1 1 CCGGT GGCTAGCTACAACGA AGAAAAAG 


2264 


CUUUUUCU A ACCGGAAG 


5043 


1 /yi 


CCAGACAC GGCTAGCTACAACGA TCTTCCGG 


2265 


CCGGAAGA G GUGUCUGG 


5044 


T T Q 

1 / y O 


f^TT\f^ TV TV f^f^ TV ^ y^fTi TV TV TV TV TV /^m /"i mm 

CTCCAGAC GGCTAGCTACAACGA ACTCTTCC 


2266 


GGAAGAGU G GUCUGGAG 


5045 


1 O 

loUU 


CAACTTGC GGCTAGCTACAACGA TCCAGACA 


2267 


UGUCUGGA G GCAAGUUG 


5046 


1804 


TTTGCAAC GGCTAGCTACAACGA TTGCTCCA 


2268 


UGGAGCAA G GUUGCAAA 


5047 


1807 


TGCTTTGC GGCTAGCTACAACGA AACTTGCT 


2269 


AGCAAGUU G GCAAAGCA 


5048 


1812 


TCCAATGC GGCTAGCTACAACGA TTTGCAAC 


2270 


GUUGCAAA G GCAUUGGA 


5049 


1 Q 1 y1 

lol4 


TV rprri/^/^ tv tv rp /^rn tv /^m tv tv tv A~i/~t tv /^mmm^^ -pi 

ATTCCAAT GGCTAGCTACAACGA GCTTTGCA 


2271 


UGCAAAGC A AUUGGAAU 


5050 


T Q O A 


^n^i ^n^^^n^^ TV fn ^i^n tv ^< ^if^ tv tv tv /~\/~% tv m tv tv 

TGTCTGAT GGCTAGCTACAACGA TCCAATGC 


2272 


GCAUUGGA A AUCAGACA 


5051 


1 o o c 


AGTGCTGT GGCTAGCTACAACGA CTGATTCC 


2273 


GGAAUCAG A ACAGCACU 


5052 


T Q O O 


ILAAGTGC GGCTAGCTACAACGA TGTCTGAT 


2274 


AUCAGACA G GCACUUGA 


5053 


1830 


CTTCAAGT GGCTAGCTACAACGA GCTGTCTG 


2275 


CAGACAGC A ACUUGAAG 


5054 


1841 


AGCTGCAC GGCTAGCTACAACGA CCTCTTCA 


2276 


UGAAGAGG G GUGCAGCU 


5055 


1843 


GCAGCTGC GGCTAGCTACAACGA ACCCTCTT 


2277 


AAGAGGGU G GCAGCUGC 


5056 


1846 


CCCGCAGC GGCTAGCTACAACGA TGCACCCT 


2278 


AGGGUGCA G GCUGCGGG 


5057 


1849 


GCTCCCGC GGCTAGCTACAACGA AGCTGCAC 


2279 


GUGCAGCU G GCGGGAGC 


5058 


1 o c c 

lobb 


CCGACAGC GGCTAGCTACAACGA TCCCGCAG 


2280 


CUGCGGGA G GCUGUCGG 


5059 


185 8 


/^rprrr/^ ^1^1 tv ^^^i^^H tv /"vi i i tv tv tv ^i^i tv tv nn^^^^^^^^ 

CTTCCGAC GGCTAGCTACAACGA AGCTCCCG 


2281 


CGGGAGCU G GUCGGAAG 


5060 


1 o b D 


A ^/^TT/^Ti/^ /^rn TV i^rn tv tv tv /^^^ tv mm a^/^/^ tv /*t tv 

ALLICTGC GGCTAGCTACAACGA TTCCGACA 


2282 


UGUCGGAA G GCAGAGGU 


5061 


1 O / 1 


rP/^ ^^/^rp^^ TV /^m TV /~i y^rp tv tv tv /^/^ tv y^m/^fn/^ 

TGCCTGAC GGCTAGCTACAACGA CTCTGCTT 


2283 


AAGCAGAG G GUCAGGCA 


5062 


1 o /b 


^% TV fp^^ ^l^n^^^l ^^^l^^^n TV ^1 / If f 1 TV ^1 TV TV TV ^l^rt^i TV ^i^^m.^ 

GATGCTGC GGCTAGCTACAACGA CTGACCTC 


2284 


GAGGUCAG G GCAGCAUC 


5063 


1 D '7 Q 


^^/*^ TV ^1^1 ^1 /^/^<l^n TV ^i^^m TV TV TV TV rny^t y^^^m^^ tv y*^ 

CCCGATGC GGCTAGCTACAACGA TGCCTGAC 


2285 


GUCAGGCA G GCAUCGGG 


5064 


1881 


TTCCCGAT GGCTAGCTACAACGA GCTGCCTG 


2286 


CAGGCAGC A AUCGGGAA 


5065 


1889 


GGCCTGGC GGCTAGCTACAACGA TTCCCGAT 


2287 


AUCGGGAA G GCCAGGCC 


5066 


1894 


GGGCGGGC GGCTAGCTACAACGA CTGGCTTC 


2288 


GAAGCCAG G GCCCGCCC 


5067 


18 98 


AGCAGGGC GGCTAGCTACAACGA GGGCCTGG 


2289 


CCAGGCCC G GCCCUGCU 


5068 


1903 




0 0 Q A 


CCCGLCLU G GCUGACGU 


506 9 


1907 


CTGGACGT GGCTAGCTACAACGA CAGCAGGG 


2291 


CCCUGCUG A ACGUCCAG 


5070 


1909 


GTCTGGAC GGCTAGCTACAACGA GTCAGCAG 


2292 


CUGCUGAC G GUCCAGAC 


5071 


1915 


AGCGGAGT GGCTAGCTACAACGA CTGGACGT 


2293 


ACGUCCAG A ACUCCGCU 


5072 


1920 


GATGAAGC GGCTAGCTACAACGA GGAGTCTG 


2294 


CAGACUCC G GCUUCAUC 


5073 


1925 


TTGGGGAT GGCTAGCTACAACGA GAAGCGGA 


2295 


UCCGCUUC A AUCCCCAA 


5074 
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1 Q "2 O 


HjILACjCjL CsCjCTACjCTACAACGA ttggggat 


2296 


ATT^^^^TVTV /^^T T/^ TV y^/^ 

AUCCCCAA G GCCUGACG 


5075 


lyj o 


LACjCCLGI GCjCTAGCTACAACGA caggcttg 


2297 


CAAGCCUG A ACGGGCUG 


5076 




GCCGCAGL GGL 1 AGCl AC AACGA CCGTCAGG 


2298 


CCUGACGG G GCUGCGGC 


5077 




ILGGLCvaL GGLIAGCTACAACGA AGCCCGTC 


2299 


GACGGGCU G GCGGCCGA 


5078 




LAATCGGC GGCTAGCTACAACGA CGCAGCCC 


2300 


GGGCUGCG G GCCGAUUG 


5079 


1 Q C ") 


i i LACAAT GGCTAGCTACAACGA CGGCCGCA 


2301 


UGCGGCCG A AUUGUGAA 


5080 


1 Q c; R 


Ai G i 1 LAL GGL 1 AGCTACAACGA AATCGGCC 


2302 


^^/^/^ TV T TT T /^T T/^ TV TV TV T T 

GGCCGAUU G GUGAACAU 


5081 


1 Q C Q 


G 1 CCA 1 G 1 GGCTAGCTACAACGA TCACAATC 


2303 


GAUUGUGA A ACAUGGAC 


5082 


T OCT 


TAGTCCAT GGCTAGCTACAACGA GTTCACAA 


2304 


UUGUGAAC A AUGGACUA 


5083 




GACGTAGT GGCTAGCTACAACGA CCATGTTC 


2305 


GAACAUGG A ACUACGUC 


5084 


1 Q ^ O 

lyo o 


TV C^f^ TV f^f^ Ti /^rp 7\ /^rri TV TV TV f^/^ TV Tv ^rjy/^/^ tv m/^ 

CACGACGT GGCTAGCTACAACGA AGTCCATG 


2306 


CAUGGACU A ACGUCGUG 


5085 


±y /u 


LCCACGAC GGC TAGCT ACAACGA GTAGTCCA 


2307 


UGGACUAC G GUCGUGGG 


5086 


xy /o 


GCICCLAL GGCTAGCTACAACGA GACGTAGT 


2308 


ACUACGUC G GUGGGAGC 


5087 


1 O T Q 

1 y / y 


/~»rpm/— irp/~1/^/-^ /^/^0'T'A/'*'/^T'A/~'A A/~1/^A mO/^/^A f^/~^ A 

G i i C i GGL GGCTAGCTACAACGA TCCCACGA 


23 09 


UCGUGGGA G GCCAGAAC 


5088 


i. y o D 


CGGAACGi GGCl AGCTACAACGA TCTGGCTC 


2310 


GAGCCAGA A ACGUUCCG 


5089 




i GCGGAAC GGC i AGC i AC AACGA G 1 TCTGGC 


2311 


GCCAGAAC G GUUCCGCA 


5090 


X y y^ 


X 1 C 1 C 1 GC GGC i AGCTACAACGA GGAACGTT 


2312 


TV TV ^^^^T T^l^^ ^1 ^1 ^% TV ^ TV TV 

AACGUUCC G GCAGAGAA 


5091 


9 n fi 

i6 vJ U D 


CUCiCGGC GGC 1 AGC i ACAACGA CCTCTTTT 


2313 


AAAAGAGG G GCCGAGCG 


5092 


zuxx 


iGAGACGC GGC T AGC i ACAACGA TCGGCCCT 


2314 


AGGGCCGA G GCGUCUCA 


5093 


^ U J. J 


GG i GAGAC GGCTAGCTACAACGA GCTCGGCC 


2315 


GGCCGAGC G GUCUCACC 


5094 


^ u xo 


CJ.CGAGG1 GGC 1 AGCTACAACGA GAGACGCT 


2316 


AGCGUCUC A ACCUCGAG 


5095 


Z U Z / 


GCC i 1 CAC GGC 1 AGC TACAACGA CCTCGAGG 


2317 


CCUCGAGG G GUGAAGGC 


5096 


^ U J J 


TV TV TV ^^*T^/^ /^^^/^T* 7\ /^^^*Ti tv ^ tv ^ f^r^ tv ^^mm^ tv /^/^/^ 

AACAGIGC GGCl AGC TACAACGA CTTCACCC 


2318 


GGGUGAAG G GCACUGUU 


5097 


Z U J D 


T'O A A ^ A /^T^ f^f^t " 1 ' A ^/^rp aoaa/^/^tv ri<^/niiin/-^i\ ^ 

iGAACAGx GGCTAGCTACAACGA GCCTTCAC 


2319 


GUGAAGGC A ACUGUUCA 


5098 


^ U o o 


CGCiGAAC GGCTAGCTACAACGA AGTGCCTT 


2320 


AAGGCACU G GUUCAGCG 


5099 


o r» /1 1 

U4 J 


A ^ A r^r^ r^f^ /*^rn tv /^m tv tv tv ^^/^ ti mr^ tv tv ^ tv ^ rr^ 

GAGCACGC GGCTAGCTACAACGA TGAACAGT 


2321 


ACUGUUCA G GCGUGCUC 


5100 




i IGAGCAC GGCTAGCTACAACGA GCTGAACA 


2322 


UGUUCAGC G GUGCUCAA 


5101 


0 n A T 


AOT'T'/*' A OO /^/^/^rn A A/^AA^^A tv o/*i i^rn/~i tv tv 

AG! iGAGC GGCTAGCTACAACGA ACGCTGAA 


2323 


UUCAGCGU G GCUCAACU 


5102 


0 n c: o 

/I U D Z 


CiCGlAGl GGCTAGCTACAACGA TGAGCACG 


2324 


CGUGCUCA A ACUACGAG 


5103 


once 
^ U b D 


CCGCTCGT GGCTAGCTACAACGA AGTTGAGC 


2325 


GCUCAACU A ACGAGCGG 


5104 


o n c Q 

zub y 


GCGCCCGC GGCTAGCTACAACGA TCGTAGTT 


2326 


AACUACGA G GCGGGCGC 


5105 


z Ub J 


CGCCGCGC GGCTAGCTACAACGA CCGCTCGT 


2327 


ACGAGCGG G GCGCGGCG 


5106 


o r> ^ 

Z U O D 


GGCGCCGC GGCTAGCTACAACGA GCCCGCTC 


2328 


GAGCGGGC G GCGGCGCC 


5107 




CGGGGCGC GGCTAGCTACAACGA CGCGCCCG 


2329 


CGGGCGCG G GCGCCCCG 


5108 


o rj "7 n 


GCCGGGGC GGCTAGCTACAACGA GCCGCGCC 


2330 


GGCGCGGC G GCCCCGGC 


5109 


0 mc 


CAGGAGGC GGCTAGCTACAACGA CGGGGCGC 


2331 


GCGCCCCG G GCCUCCUG 


5110 


9 Pi P c: 
Z U O D 


i\r^T\(~*r^r^r^c^ /^/^/-irpAr^/'^«T>A/^A a/^/^a /^oa/^/-i a /-i^ 
AGAGGCGC GGCTAGCTACAACGA CCAGGAGG 


2332 


CCUCCUGG G GCGCCUCU 


5111 


o n Q o 
Z U o / 


ACAGAGGC GGCTAGCTACAACGA GCCCAGGA 


2333 


UCCUGGGC G GCCUCUGU 


5112 


o n Q 0 
z uy J 


f^f^f^ TV f% TV /"I ^y^^rn tv /^/*im tv tv tv /^^^ tv tv ^ tv x^i^^ /^.^^ 

CCCAGCAC GGCTAGCTACAACGA AGAGGCGC 


2334 


GCGCCUCU G GUGCUGGG 


5113 


o n Q c 
z u yb 


GGCCCAGC GGCTAGCTACAACGA ACAGAGGC 


2335 


GCCUCUGU G GCUGGGCC 


5114 


z lUU 


GTCCAGGC GGCTAGCTACAACGA CCAGCACA 


2336 


UGUGCUGG G GCCUGGAC 


5115 


2106 


GATATCGT GGPTAOPTAr'AAPfJa PPAnrtPPP 


O "5 "5 O 
Z J J / 


oCtVjiCCU^jG a ALGAUAUC 


5116 


2109 


GTGGATAT GGCTAGCTACAACGA CGTCCAGG 


2338 


CCUGGACG A AUAUCCAC 


5117 


2111 


CTGTGGAT GGCTAGCTACAACGA ATCGTCCA 


2339 


uggacgau a AUCCACAG 


5118 


2115 


GGCCCTGT GGCTAGCTACAACGA GGATATCG 


2340 


CGAUAUCC A ACAGGGCC 


5119 


2120 


CGCCAGGC GGCTAGCTACAACGA CCTGTGGA 


2341 


UCCACAGG G GCCUGGCG 


5120 


2125 


AGGTGCGC GGCTAGCTACAACGA CAGGCCCT 


2342 


AGGGCCUG G GCGCACCU 


5121 
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234 J 


GGCCUGGC G GCACCUUC 


5122 


O 1 0 Q 


AHjAACjCjI CjbLiAvjLlALAALCjA CjCGCCAGG 


2344 


CCUGGCGC A ACCUUCGU 


5123 




LCjCAGCAL GGCl AGL rALAACGA GAAGGTGC 


2345 


GCACCUUC G GUGCUGCG 


5124 


/ 


LACGCAGL GGLTAGCTACAACGA ACGAAGGT 


2346 


ACCUUCGU G GCUGCGUG 


5125 




"A TV f^^^^^^n 71 ^^^n 7V ^* TV TV TV TV /"^ Ti TV TV 

GCACACGC GGCTAGCTACAaCGA AGCACGAA 


2347 


UUCGUGCU G GCGUGUGC 


5126 




CCGCACAC GGCTAGCTACAACGA GCAGCACG 


2348 


CGUGCUGC G GUGUGCGG 


5127 


O 1 /I /I 


GLLLGCAC GGCTAGCTACAACGA ACGCAGCA 


234 9 


UGCUGCGU G GUGCGGGC 


5128 


Z 14d 


GGGLCCGC GGCTAGCTACAACGA ACACGCAG 


2350 


CUGCGUGU G GCGGGCCC 


5129 


o 1 c n 

Z X3 U 


ICL-iGGGC GGCTAGCTACAACGA CCGCACAC 


2351 


GUGUGCGG G GCCCAGGA 


5130 


^ X o / 


CGGCGGGi GGCIAGCIACAALGA CCTGGGCC 


2352 


GGCCCAGG A ACCCGCCG 


5131 


Z J. 0 J. 


CAGGCGGC GGL i AGL i AC AACGA GGGTCCTG 


2353 


CAGGACCC G GCCGCCUG 


5132 


O 1 c/. 

Z ±04 


GC i CAGGC GGL i AGC i ACAACGA GGCGGGTC 


2354 


GACCCGCC G GCCUGAGC 


5133 


Z± /U 


AGTACAGC GGCTAGCTACAACGA TCAGGCGG 


2355 


CCGCCUGA G GCUGUACU 


5134 


-<i 1 /i 


CAAAGTAC GGCTAGCTACAACGA AGCTCAGG 


2356 


CCUGAGCU G GUACUUUG 


5135 


/ b 


TV TV TV A /^<Ti /^/^ /^rri tv /^/^rp TV TV TV TV TV TV rn tv 

GALAAAGl GGCTAGCTACAACGA ACAGCTCA 


2357 


UGAGCUGU A ACUUUGUC 


5136 


^ 1 Q A 


A /^/^n^T^^^ TV /"^ ^/*irn A /~t m tv tv ^ r*i tv tv tv tv /~i m tv ^ tv 

ACCiiGAC GGCTAGCTACAACGA AAAGTACA 


2358 


UGUACUUU G GUCAAGGU 


5137 




A A TT/^I^TV ^ /^/^OTl A /'•/^rPTV A TV TV / 11 III 11^ TV TV TV 

ACAiCCAC GGCTAGCTACAACGA CTTGACAA 


2359 


UUGUCAAG G GUGGAUGU 


5138 




CGTCACAT GGCTAGCTACAACGA CCACCTTG 


2360 


CAAGGUGG A AUGUGACG 


5139 


'5 100 


CCCG 1 CAC GGC i AGCTACAACGA ATCCACCT 


2361 


AGGUGGAU G GUGACGGG 


5140 


Z X7b 


GCGCCCG 1 GGC i AGC TAC AACGA CACATCCA 


2362 


UGGAUGUG A ACGGGCGC 


5141 




GiALGCGC GGCTAGCTACAACGA CCGTCACA 


2363 


UGUGACGG G GCGCGUAC 


5142 


o o n 1 
41 ^ U 1 


rp/~1/~irp TV f~\f~^ f~\TT\ TV f~\ /~irp TV /~« TV TV /^/^ TV #^ /~i/'^/~v/^mj^ TV 

I CG i ACGC GGC 1 AGCTACAACGA GCCCGTCA 


2364 


UGACGGGC G GCGUACGA 


5143 


T O A 'J 


IGTCGTAC GGCTAGCTACAACGA GCGCCCGT 


2365 


ACGGGCGC G GUACGACA 


5144 


O O A C 
ZZUD 


GGTGTCGT GGCTAGCTACAACGA ACGCGCCC 


2366 


GGGCGCGU A ACGACACC 


5145 


O O A Q 


^1 TV ^H^I^IHl^lrn f^i^ / 1 1 TV ^1 TV TV TV ^1^1 TV ^^^^fTl TV ^^^^^^^ 

GATGGTGT GGCTAGCTACAACGA CGTACGCG 


2367 


CGCGUACG A ACACCAUC 


5146 


O O T A 
Z Z 1 U 


GGGATGGT GGCTAGCTACAACGA GTCGTACG 


2368 


CGUACGAC A ACCAUCCC 


5147 




iGGGGGAl GGCTAGCTACAACGA GGTGTCGT 


2369 


ACGACACC A AUCCCCCA 


5148 




GAGCC i G i GGC 1 AGC i ACAACGA CCTGGGGG 


2370 


CCCCCAGG A ACAGGCUC 


5149 


9 7 
ZZZ / 


CCGiGAGC GGC 1 AGCTACAACGA CTGTCCTG 


2371 


CAGGACAG G GCUCACGG 


5150 


O O 1 
Z Z J 1 


TV r^T^ (~^f~^r^Ty /^O/^T' A /^OT" TV O TV TV /^J^ TV TV /~1 /~1 PH /"« m 

ACCiCCGl GGCTAGCTACAACGA GAGCCTGT 


2372 


ACAGGCUC A ACGGAGGU 


5151 


O O 1 7 
ZZJ / 


f^r^ TV rri/^ TV /^^/*irp TV /^/^m TV TV TV /^/^ TV /^m/~i/^/^m/~i TV 

GCGAIGAC GGCTAGCTACAACGA CTCCGTGA 


2373 


UCACGGAG G GUCAUCGC 


5152 


O A 
ZZ4 U 


/' 11 1 1/^^^ /^/^ A rp /^/^/^rp TV 1 1 TV TV TV f^f^ TV TV /"IfTl /^/^ 

CiGGCGAT GGCTAGCTACAACGA GACCTCCG 


2374 


CGGAGGUC A AUCGCCAG 


5153 


O T >l "3 


ATGCTGGC GGCTAGCTACAACGA GATGACCT 


2375 


AGGUCAUC G GCCAGCAU 


5154 


ZZ^ / 


GATGATGC GGCTAGCTACAACGA TGGCGATG 


2376 


CAUCGCCA G GCAUCAUC 


5155 


z z4 y 


TTGATGAT GGCTAGCTACAACGA GCTGGCGA 


2377 


UCGCCAGC A AUCAUCAA 


5156 


O O C O 

zzbz 


GGTTTGAT GGCTAGCTACAACGA GATGCTGG 


2378 


CCAGCAUC A AUCAAACC 


5157 


O O C T 


TCTGGGGT GGCTAGCTACAACGA TTGATGAT 


2379 


AUCAUCAA A ACCCCAGA 


5158 


o 0 c t; 
Z Z D b 


rp TV /^/^ rp/^rp r^r^ rp tv rp tv tv tv tv m /^m m 

G i ALGTGT GGCTAGCTACAACGA TCTGGGGT 


2380 


ACCCCAGA A ACACGUAC 


5159 


zz b / 


CAGTACGT GGCTAGCTACAACGA GTTCTGGG 


2381 


CCCAGAAC A ACGUACUG 


5160 


O O Q 

zzb 7 


CGCAGTAC GGCTAGCTACAACGA GTGTTCTG 


2382 


CAGAACAC G GUACUGCG 


5161 


O O T 1 


CALGLAGi GGCTAGCTACAACGA ACGTGTTC 


2383 


GAACACGU A ACUGCGUG 


5162 


2274 


ACGCACGC nnPTAnr'TAr'AAmA ArtTAmTH 


Z J Oft 


r'anr'TTar'TT c nnr*TTr*r*r'n 
L./\UkjUAUU \j UL-GUGL-GU 


b 16 J 


2276 


CGACGCAC GGCTAGCTACAACGA GCAGTACG 


2385 


CGUACUGC G GUGCGUCG 


5164 


2278 


ACCGACGC GGCTAGCTACAACGA ACGCAGTA 


2386 


UACUGCGU G GCGUCGGU 


5165 


2280 


ATACCGAC GGCTAGCTACAACGA GCACGCAG 


2387 


CUGCGUGC G GUCGGUAU 


5166 


2284 


CGGCATAC GGCTAGCTACAACGA CGACGCAC 


2388 


GUGCGUCG G GUAUGCCG 


5167 


2286 


CACGGCAT GGCTAGCTACAACGA ACCGACGC 


2389 


GCGUCGGU A AUGCCGUG 


5168 
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2288 


APr'APfSf^P rinPTAnPTAPA APOA ATAPPPAP 


o "3 on 


GULGGUAU G GLLGUGGU 


3 lb y 


2291 


TnoAPPAP fzrzPTAPPTAPA App A nnnii.^tinn 


O "3 Q 1 

z J y J. 


GGUAUGLL G GUGGuLLA 


b 1 /U 




TTPTf^riAP pppTAnpTPAPA APnA c'h.nnnn'h.^ 


o "a Q o 
z J y Z 


ATT/^/^/^^TT/^ ^ /^TTOOA/^AA 

AUGGLGUG G GULLAGAA 


5171 


^ J U J 


TnrjfiPi^fiP pppTAnPTAPA APPA nT^n^nn'h. 


o "a Q T 
z J ^ J 


TT/^^A/^AA/^ o r*r*f~^r*r*r^r^T\ 
UlLAGAAG G GLLGGLLA 


5172 


^ J V D 


ppATnnnp nnpTA^PTAPAAPPA rupppTTPT 


O 1 Q A 

z J 


AGAAGGLG G GLLLAUGG 


bl / J 






O O Q C 

z J y b 


GGLlGLll a AUGGGlAl 


5174 




fzrjAPnTnp rinPTAPPTAPAAPPa nms.Tnnnn 


O "a Q 

z J y D 


GLLGAUGG G GLALGUCL 


5175 


9*^ 1 

Z J X D 


npnnAPfiT nnpTAPPTAPAAPPA nnnr^'&.'mn 


O O Q T 

zo y / 


LLAUGGGL A ALGUllGl 


5176 


9'^1 ft 
Z ^ J. o 


TTnPPflAP ^inPTAPPTAPa APPB PTrTTT'aT 


O T Q O 

z J y o 


AUGGGCAC G GUCCGCAA 


5177 


2322 


nnr*r^'wr*r* ppp'PAr^r^T'ar'a ar^oA r^f^7\.r*/~"vr*r^ 

ooV^^^liLj*^ vjoU 1 AoL, 1 AL.AAL-ljA vjoAUVjIoL 


2 3 99 


GLALGUCC G GCAAGGCC 


5178 


2327 


TTnAAnriP nfSPTAPPTAPAAPPA c^^rr^nn'R. 


o /I n n 
z4 U U 


ULLGLAAG G GLLUULAA 


5179 


233 7 


aAorymrxr* nnPT a fiPT a p a a pp a tpttp a a p 

VjMV^o 1 ool- VjrtjL. i Aov, i AL, AAL-ljA 1 1 i OAAu 


z4Ul 


/^TTTTl^AA/^A /~< /^/^/^ A /^i/^T T/"* 

LUUlAAGA G GLLAlGUl 


5180 


2340 


ArSArSAPnT nPPTArtPTAPAAPPA nCT'Tr'T'Vr' 


z4Uz 


OAA/^A/^O/^ TV A O/^T T/^T T 

laagagcc a acgucucu 


5181 


2342 


HTAriAPAP nriPTAnPTAPA APPA n^firT^T'n^ 


O /I n "3 
Z4 U J 


AGAGLLAL G GULULUAL 


5182 


Z J 1 o 


vj i L-AAoVj 1 laoL. i AVjU 1 A^^AAL-vjA A\jAvjAL.u i 


2404 


AlGUlUlU a AllUUGAl 


5183 


9"^ "^4 
z o o t 


A^Lj 1 U 1 vj 1 ^jlaL. 1 AVjU 1 AL-AA^^VjA LAAVjIj i Avj 


z40b 


i^T T TV TT T/^ TV TV TV /"I TV /^T T 

lUACCUUG a AlAGACCU 


5184 


9 c; R 

Z J D O 


V- X oVaAovj 1 LioL. 1 AljC 1 A\-AAL.VjA L. 1 1 UAACj 


2406 


TT TV TV TV TV T ^ TV ^ 

CUUGACAG A ACCUCCAG 


5185 


Z J O D 


rp/^rpTv /->fj^i^ /^/^r'T'Ar'/^T'A/^A A/^/^ A T"/^/^ A I^^T^/^ 

HjIAUvjoU vjVjI^ i AljCi AVwAACvjA i CjVjAIjVji i L 


24 07 


GACCUCCA G GCCGUACA 


5186 


Z J o o 


PPATPTAP PPPTAP/^'PA/^A Arr* A ri/^f^'vrT'T^r* 
^J^AXljlAL, LjVjL. i AOL.! AL.AAL.VjA LKjLlLjVjAtj 


24 08 


/~v\ ir^fs y\ /"itttv/^tvt "rr^ 

LULlAGCC g guacaugc 


5187 


z o / u 


TPPPATTST PPPTSPPTia/^a Arr* A Ti/^r'r^r^Tr^r* 
XLLjV^AUjI VjVjL. i AvjUx ALAALoA ALOljLioo 


2409 


CCAGCCGU A ACAUGCGA 


5188 


2 3 72 


TriTPPPAT PPPfapPT'ar'A Arr* A r^TAr'*^r»r"n 
iVjiLVjLAl VjVjL. i AVjL i ALAAUjA (jlALooLl 


2410 


AGCCGuAC A AUGCGACA 


5189 


23 74 


APT'PT'Pi^P f^/^/^T'A/^O'T'A A A f^/"* A A T'/^T' A /^O/^ 

AL.iVjiHjL LiLjL 1 AvjL i ALAAUjA AiLyiALLjVj 


2411 


ccguacau g gcgacagu 


5190 


2377 


PPAAPTY2T PPPfAPPTA/^a ArT*a /^Or^ ATr'T' A 
L OAAL 1\J± kjLjL 1 AoL 1 AL-AALvjA L-VjLA 1 o 1 A 


z4 Iz 


TT7V TV T T/^ /^/^ TV TV TV T TT T 

UACAUGCG A ACAGUUCG 


5191 


2380 


PPAPPAAP ('SPPTAPPT'AP A APP A TPTrTTiaT 
v-L.rt.v^VjAAL oVjL^ 1 AoL 1 Av^AALVjA iVjlUolJAi 


z41 J 


AUGCGACA G GUUCGUGG 


5192 


2 3 84 


TnZXnr'PAP PPPTAPPT'APA APPA PaaPTr'TT' 
i L. UAL oVjL i AlaL i AL.AAHjA oAAL X Ij 1 L 


O /I 1 >l 

2414 


GACAGUUC G GUGGCUCA 


5193 


23 87 


APr^Tr^app p/^^Tar^/^TA/^A a/^/^a oaoi^aa/^t" 
AUU I vjjALrL LtLjL i ALj L i AL AAL CjA L AC GAACT 


2415 


AGUUCGUG G GCUCACCU 


5194 


23 91 


PTPPAPPIT PPPTAPPTar^a a/^r* A A/^r'/^A/^/"' 
L X oL.AVjVj X LjLtL i AL»L 1 AL.AAHjA LxALjLLALvj 


2416 


r^f~^ T T/^ /^T T TV TV ^ /^T T/^ TV 

CGUGGCUC A ACCUGCAG 


5195 


2395 


TPTPPTPP PPPTAPPTAPa a^r'A Aor"p*^Ar«/^ 
X L X L.L X vjL LtoL i AoL 1 AL AALoA Aoo i UAoL. 


2417 


^IT TA^ TV /^T T TV /~1 TV TV 

GCUCACCU G GCAGGAGA 


5196 


24 02 


nr^nPTPPT* ppPT'a/^r'T'A/^a aor* A /^T/^oT/^r«A 
'^(^(jL J. LjLj i LtoL i AvjL i ALAALuA LiLL-ioLA 


2418 


UGCAGGAG A ACCAGCCC 


5197 


24 06 


ChCZCnrTT' PPPTar'/~"pa<^A a^t^a •T'/^r"T"/^T»r'o 
LAVjLLjLtVjL oLiL X AVjL X ALAALoA ILivjiLiLL 


2419 


GGAGACCA G GCCCGCUG 


5198 


2410 


Ccr^^C'L.rzr* PPP'pa/^r^'paoaar^/^A r*r^r^r^'Tr^r*^ 
L L U X LALjL LjkjL i ALjL i ALAALvjA oVj(jL 1 i 


2420 


ACCAGCCC G GCUGAGGG 


5199 


2418 


flAPPPPAT PPPfaP/^Ta/^aao/^a r*r*r*rTiri'Ar>r* 
Vj/\V_LjVjLA X kjvjL 1 AvjL i ACAALLrA LLL 1 LAvjL. 


2421 


GCUGAGGG A AUGCCGUC 


5200 


2420 


r\\^^J^\^\j\j\^ VJoU 1 1 M.L-AALvjA AXLL.LxLA 


2422 


T T/^ TV TV T T ^^/^T T/^/^T T 

UGAGGGAU G GCCGUCGU 


5201 


2423 


ATPAPPAP PPPTAPPT AP A APPa PrT'BTrTT' 
■f^ X L3i-i.VwVj/\L VjVjL X AvjL X ALAAUjA LjLiLAILLL 


z4z J 


GGGAUGCC G GUCGUCAU 


5202 


2426 


TPPATPAP PPPTAPPTa/^a a/^na r'arrTT^A'p 

X V^O/^ X VjAL L>VjL X AkjL X AL AAUjA VjALvjVjL A i 


z4z4 


TV T T/*»/^/^^T T/^ />T Ti^ TV TT/~1/~I TV 

AUGCCGUC G GUCAUCGA 


5203 


2429 


TPP'PPPA'P PPPT'appTa ^a A A /^a/^oao/^o 
i VjL X HjA X LjVjL i A^jL X AL AALvjA VjAHjACoVj 


2425 


CCGUCGUC A AUCGAGCA 


5204 




ALrL i L i LtL UbL i ALrL i AL AALvjA TCGATGAC 


2426 


GUCAUCGA G GCAGAGCU 


5205 


9 4 7 Q 
Z J ^ 


pp a /^r^ a /^/^/^tao/^tia/^a a/^oa t'/^'p/^^iti/^/^ 
GGAGGAGL GbL i AGL 1 ALAALGA TCTGCTCG 


2427 


CGAGCAGA G GCUCCUCC 


5206 


94 1 

Zrt D J. 


PPPPT'OaT' /^/^/^T" AO/^iT" A A A /^/^ A T"/^ A /^y^O n O 

LjLtCL 1 LA i (jGL 1 AGLTAlAAlGA TCAGGGAG 


2428 


CUCCCUGA A AUGAGGCC 


5207 


94 

Z 3 D 


L 1 GL. J. GGv- GGL 1 AGL x AL AACGA C T C ATTCA 


2429 


UGAAUGAG G GCCAGCAG 


5208 


94^n 

Z 4 o u 


VjLtL^AL J. vjL GGL i AGL i ALAALCjA iGGLLiCA 


2430 


UGAGGCCA G GCAGUGGC 


5209 


2463 


GAGGCCAC GGCTAGCTACAACGA TGCTGGCC 


2431 


nPPPAPPA P PTTPPPPTTP 


Oz 1 u 


2466 


GAAGAGGC GGCTAGCTACAACGA CACTGCTG 


2432 


CAGCAGUG G GCCUCUUC 


5211 


2475 


GAAGACGT GGCTAGCTACAACGA CGAAGAGG 


2433 


CCUCUUCG A ACGUCUUC 


5212 


2477 


AGGAAGAC GGCTAGCTACAACGA GTCGAAGA 


2434 


UCUUCGAC G GUCUUCCU 


5213 


2485 


TGAAGCGT GGCTAGCTACAACGA AGGAAGAC 


2435 


GUCUUCCU A ACGCUUCA 


5214 


2487 


CATGAAGC GGCTAGCTACAACGA GTAGGAAG 


2436 


CUUCCUAC G GCUUCAUG 


5215 
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i ooL. AUA i vjLju i ACjC 1 AL. AACCjA CjAAVjC-Cj 1 A 


0 yl 0 T 

24 J 7 


UALGLUUC A AUGUGCCA 


bzlo 


O /I Q i1 


vjvj 1 Lr^jLAL vjCjL i AOL 1 ALAALCjA A 1 GAACjLCj 


243 8 


CGCUUCAU G GUGCCACC 


5217 




r**vc*r*^r*r*r* /^/^r^TA/^/^nr'A/^A a/^/^ a A/^AT^r^A a^* 
UiVjtjUjVjL. vjVjL-lAtjLi ALAALCjA ALAILjAALj 


243 9 


LUULAUGU G GLLALLAL 


5218 




vjoLVj 1 \jK3 i VjVjL 1 AUL 1 ALAALCjA CjCjCJACATG 


244 0 


LAUGUGLL A ALCALGCC 


5219 




UAV-IjVjLo i vjCjL I ACjL 1 AL AALtjA liVj 1 CjCjLAL 


2441 


GUGLLALL A ACGCCGUG 


C 0 T A 

5220 




LCjLALCjCjL CjLL IACjL 1 ALAALCjA GTGGTGGC 


2442 


GCCACCAC G GCCGUGCG 


5221 




AikjLoLAL GGL i AGL i ALAALGA GGLGiGGi 


z44 3 


ACLALGCC G GUGCGCAU 


5222 




iGAiGLGL GGLi AGL 1 ALAALGA ALGGLGiG 


2444 


CACGCCGU G GCGCAUCA 


5223 


Z O X ± 


LL i GA 1 GL viGL i AGL i AL AALGA GLALCiGLG 


2445 


CGCCGUGC G GCAUCACjG 


5224 


Z D X J 


LLLLlGAi GGLi AGL lAL AALGA GLGLALGG 


2446 


CCGUGCGC A AUCAGGGG 


5225 


o 0 n 


GGAL 1 i GL GGL 1 AGL 1 AL AALGA LLLTGATG 


2447 


CAUCAGGG G GCAAGUCC 


5226 


Z 3 Z ^ 


i AVjGAL GGL i AGL 1 AL AALGA i i GLLLL i 


2448 


A/^^/^/^/^AA T/^/^T T A 

AGGGGCAA G GUCCUACG 


5227 


Z D Z 27 


^ 1 vjVjA^Vj 1 uLrLr X AVjL 1 ALAALGA AGGAL 1 1 G 


z44y 


OA A /^T TOOT T A AOOTTOOAO 

LAAGULLU A ACGUCCAG 


5228 


^ D O -L 


^-.AL. 1 VjOAL- LjoL. i rt.Liv_ 1 /\L./\/\LGA G1 AGGAL i 


z4 bCJ 


AOTTO/^TAO 0 OTTOOAOTTO 

AGULLUAL G GUCCAGUG 


522 9 


Z Z3 J O 


ppTnnpap n/^PTA^^PTAPaapPA T/^r^a/^r'Ta 
X ooV_/i^ oVjL. 1 AoV^ 1 AL,AALGA i GGALG 1 A 


0 >i c 1 
z4bl 


UALGULLA G GUGCLAGG 


523 0 


Z D .3 O 


cr^r^CTrinr* r^^PTappTAPa appa a/^T/^r«ar'/^ 
V-V_V„l_ I VjIjC GGL I AGL 1 ALAALGA AL i GGALG 


z4b2 


OOTTOOAOTT O OOOAOOOO 

LGULLAGU G GCCAGGGG 


5231 


Z D *± D 


T'PPnPi^ A T pppTAPPTar^ A A/^/^a r*rtrTiTr*/^r* 
J.0V.VJGGAI GGL 1 AGL 1 ALAALGA LLLLIGGL 


0 >i c 0 
2453 


GCCAGGGG A AUCCCGCA 


5232 


9 R 1 
Z D O X 


/\VjL-LL1LL VjIjL i AGL i ALAALGA GGGAILLL 


2454 


GGGAUCCC G GCAGGGCU 


5233 


9 t^ C: 
Z 3 3 O 


PAT'PPapp PPPT" APPf A p A A po A r^r*'vr*r^(^f^r* 
vjA J. VjVjAIjL. GGL i AGL i ALAALGA L L 1 GLGGG 


24 55 


CLCGCAGG G GCUCCAUC 


5234 


9t^ 

Z O O X 


PaPAPPA'P PPP*P APPT A P A A /^/^ A /^/^ A r^i^/^/^T" 

VjAVjAVjoAI GGL 1 AGL 1 ALAALGA GGAGLLLl 


24 56 


AGGGCUCC A AUCCUCUC 


5235 


9 7 n 

Z D / U 


APPaPPPT* PPPT* APPT A P A A A r^r*T^r*Ti.r*r*T^ 
AIjL^oLG i GGL X AGL 1 ALAALGA GGAGAGGA 


2457 


UCCUCUCC A ACGCUGCU 


5236 


9^79 
Z D / Z 


apappapp r^r'/^'T'A^r"T'AOA a<^/^a ^^^^^*^^^^^^^Ti^* 
AvaAijLAVjL- GGL i AGL 1 ALAALGA G i GGAGAG 


2458 


CUCUCCAC G GCUGCUCU 


5237 


9 c:7c; 

Z 3 / 3 


T'PPaPaPP PPPT" A/^r^T* A A A r^/^ A A /^/^/^T"*^!^ A 

1 \j AVjAo L Ly\j L i AGL i ALAALGA AGL G i GG A 


2459 


UCCACGCU G GCUCUGCA 


5238 


Z D O U 


PAPrSPT'PP PPPT^AP^T A/^A A /^/^ A A A /^/^ A /^/^ 

LAGGL i GL CjGL i AGL i ALAALGA AGAGGAGC 


2460 


GCUGCUCU G GCAGCCUG 


5239 


9 R P 
D O J 


nr^t^r^'^njnr* ppptappta/^a aooa 'Po/^Ar»Aor' 
LLALAGGL CjGL 1 AGL 1 ALAALGA TGCJAGAGCJ 


2461 


GCUCUGCA G GCCjUGUGC 


5240 


9 P 7 
Z D O / 


LG 1 AGLAL GGLx AGCTACAACGA AGGCT GCA 


2462 


UGCAGCCU G GUGCUACG 


5241 


9 ^ ft Q 
Z D O ^ 


C*C^f^C*^T^C*r^ PPPT'A^/^'PA /^A A O/"" A A O A /^^0»T«i^ 

GLLG 1 AGL GGL i AGL 1 ALAALGA ACAGGCTG 


2463 


CAGCCUGU G GCUACGGC 


5242 


9 c; Q9 

Z 3 I7Z 


(^TT^r^ r^r^T* r^r^ a 0 a ^ a a f^r^ a a/^/^a/^a/*i^ 
G i LGL LG 1 GGL 1 AGL 1 ALAALGA AGCJACACjG 


2464 


CCUGUGCU a ACGGCGAC 


5243 


9 c; Q c 

Z D !D 


LA i G i LGL GGL i AGL 1 AL AACGA CGTAGCAC 


2465 


GUGCUACG G GCGACAUG 


5244 


9 Q ft 

Z D J7 O 


PT'PPAT'/^'T' /^0/^'T'A/*'0'T'A /^A A A /-i/-i/^/^/-irn7\ /—( 

L i LL A i G i GGL i AGL 1 ALAALGA CGCCGT AG 


2466 


cuacggcg a ACAUGGAG 


5245 


■o c o n 
z o u u 


i i L i. LLA i GGL i AGL 1 ALAALGA GTCGCCGT 


2467 


ACGGCGAC A AUGGAGAA 


5246 


9^^ 09 
zou / 


PAPPTT'P'T' PPpiP A^OT" A/^ A A A rp/^rn/~i/^ A 

L AGL 1 1 G i GGL 1 AGL 1 ALAALGA TCTCCATG 


2468 


CAUGGAGA A ACAAGCUG 


5247 


9fi 1 1 
Z O X X 


\^P^Fu\\„Pi.\j\^ LjLjL 1 AGL i ALAALGA 1 IGTTCTC 


2469 


GAGAACAA G GCUGUUUG 


5248 


9fi 1 4 

^ Q X 'i 


PPflPAAAP PPPT APr^TA/^A A/^/"* A A /^^T'rp«^T"T' 

v^^wIlj*^/\/\/\^^ ^jVjL 1 AGL i ALAALGA AGL 1 i G 1 i 


24 70 


A TV TV TV /^T T TT TT T/^ /^/^ 

AACAAGCU G GUUUGCGG 


5249 


Z O X o 


AT'PPPP/^P PPPTAPr^iPA/^A AO^ A AAA/^A<^/^T' 

A i ^ LjVjL X AGL i ALAALGA AAALAGL i 


24 71 


AGCUGUUU G GCGGGGAU 


5250 


9fi94 

Z O Zrt 


Pr!!PP/^a AT* PPPT APPT A PA A PO A /^r^/^/^/^/^A A 

^ijuL.vjMAJ. LjVjL 1 AGL i ALAALGA LLLLGLAA 


24 72 


T TT ^^^^ ^1 TV TV T TT T^*1/^ /^/^ 

UUGCGGGG A AUUCGGCG 


5251 


9 9 Q 
Z OZ 7 


LGiLLLGL GGL i AGL i ALAALGA CGAATCCC 


2473 


GGGAUUCG G GCGGGACG 


5252 


Z D J f* 


LAGLLLGi GGL i AGL i ALAALGA CCCGCCGA 


2474 


UCGGCGGG A ACGGGCUG 


5253 


9 C3 R 
ZO^ O 


GGAGLAGL CjGL 1 AGCTACAACGA CCGTCCCG 


2475 


CGGGACGG G GCUGCUCC 


5254 


9CA 1 


GLAGGAGC CjGCTAGCTACAACGA AGCCCGTC 


2476 


GACGGGCU G GCUCCUGC 


5255 


OCA. 9 
zo4 / 


LLAAALGL GGCi AGCTACAACGA AGGAGCAG 


2477 


CUGCUCCU G GCGUUUGG 


5256 


2649 


CACCAAAC GGCTAGCTACAACGA GCAGGAnr 


24 78 


f^PTTPPTTnP n nTTTTTTPPTTP 


cr 0 c "7 
D Z b / 


2654 


TCATCCAC GGCTAGCTACAACGA CAAACGCA 


2479 


UGCGUUUG G GUGGAUGA 


5258 


2658 


GAAATCAT GGCTAGCTACAACGA CCACCAAA 


2480 


UUUGGUGG A AUGAUUUC 


5259 


2661 


CAAGAAAT GGCTAGCTACAACGA CATCCACC 


2481 


GGUGGAUG A AUUUCUUG 


5260 


2668 


TCACCAAC GGCTAGCTACAACGA AAGAAATC 


2482 


GAUUUCUU G GUUGGUGA 


5261 


2672 


GGTGTCAC GGCTAGCTACAACGA CAACAAGA 


2483 


UCUUGUUG G GUGACACC 


5262 
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O C T C 




z4o4 


T TOT IT TOOT TO A AOAOOTTOA 

UGUUGGUG A ALALGUCA 


bzo J 


ZD// 


CjQj 1 (jAvjCj J. CabL 1 ACjL 1 AC AALbA (j i CAL LAA 


2485 


TTTTOOTTOAO A AOOTTOAOO 

UUGGUGAC A AC LUC ACL 


5264 


O C Q O 


L vjAGCj r bGL 1 ACjC 1 AC AALGA GAGGTGTC 


24 86 


OA OA OOT TO A AOOTTOAOO 

GACACCUC A ACCUCACC 


5265 


O Q "7 

Zoo / 


GL,G i GCjG 1 GGL 1 AGL TACAACGA GAGG 1 GAG 


2487 


OTTOAOOTTO A TV ^/^/^7V /^r> 

CUCACCUC A ACCCACGC 


5266 


O C Q T 
ZO 7l 


lilLGGGI GGC, lAGLTACAACGA GGGTGAGG 


2488 


OOTTOAOOO A H/^^^^7iAA 

CCUCACCC A ACGCGAAA 


5267 


O Q 


G 1 1 1 1 LGL GGLTAGCTACAACGA GTGGGTGA 


2489 


UCACCCAC G GCGAAAAC 


5268 


o o o 

zo y y 


AGGAAGGi GGUiAGCTACAACGA TTTCGCGT 


24 90 


TV /"i /^/^ TV TV TV TV TV f^/^T TT T^^/^T T 

ACGCGAAA A ACCUUCCU 


5269 




A^/^AOO/^T' Or^^'T'A/^/^T'A OA A i^*^ A OOtPO A OO A 

AL.GAGGGT GGCiAGGTALAACGA CCTGAGGA 


24 91 


UCCUCAGG A ACCCUGGU 


5270 


O Tl "7 
Z / X / 


00»POOOAO OO OT" AOOTIAOA A OO A OAOOOT^OO 

i L.GGAU GGL i AGCTACAACGA CAGGGTCC 


2492 


OOAOOOTTO O OTTOOOAOO 

GGACCCUG G GUCCGAGG 


5271 


O "7 T /I 


AOOOAOAO OOOT» A OOrp A O A A OO A OTiOOOIVOO 

AGGGALAC GGL TAGCTACAACGA CTCGGACC 


24 93 


GGUCCGAG G GUGUCCCU 


5272 


Z / Zo 


i U AGGGAL. GGL. i AGG i AG AAGGA AG G i GGGA 


24 94 


TTOOOAOOTT O OTTOOOTTOA 

UCCGAGGU G GUCCCUGA 


5273 


OI'T.A 
z / J 4 


AOOOAT'A O OO OT" AOOTAOA A OO A T^OAOOOAO 

AGGGA i AG GGG i AGG i AG AAGGA 1 GAGGGAG 


24 95 


OTTOOOTTOA O OTTATTOOOTT 

GUCCCUGA G GUAUGGCU 


5274 


Z /JO 


OOAOOOAT* OOOT" A OOT" A O A A OO A AOrpOAOOO 

GGAGGGAl GGG TAGGTACAACGA ACTCAGGG 


2496 


CCCUGAGU A AUGGCUGC 


5275 


Z / Jzi 


OAOOOAOO OOOT" A OOT" A O A A OO A OArpAOrpOA 

GAGG GAGG GGGTAGGTACAACGA CATACTCA 


24 97 


UGAGUAUG G GCUGCGUG 


5276 


Z / *iZ 


OAOOAOOO OOOTiAOOTA OA A OO A AOOOAT^AO 

GAGGAGGG GGG i AGG 1 AG AAGGA AGGGATAC 


24 98 


GUAUGGCU G GCGUGGUG 


5277 


Z f 'k*k 


•T"POAOOAO OOOT" A OOT" A O A A OO A OOAOOOAT* 

i i GAGGAG GGG 1 AGG J. AGAAGGA GGAGG GA 1 


24 99 


ATTOOOTTOO O OTTOOTTOAA 

AUGGCUGC G GUGGUGAA 


5278 


on AO 


A A OT'T'O A O OOOT" AOOT" A OA A OO A O A OO O A O O 

AAGl iGAG GGG i AGG J. AGAAGGA GAGGCAGG 


2500 


O OT TO OOT TO O OT TO A A OT TT T 

GCUGCGUG G GUGAACUU 


5279 


9 7 c: 1 


OOOO A AOT" OOOT'AOOT'A OA A OO A T^OAOOAOO 

GGGGAAGl GGG i AGG 1 AGAAGGA TG AG GAGG 


2 5 01 


CGUGGUGA A ACUUGCGG 


5280 


Z / DO 


rnorpppOj^/^/-! OOOT" A OOT* A O A A OO A A AOT^T^OAO 

1 G 1 1 GGGG GGG i AGG i AGAAGGA AAGTTCAC 


2502 


r^J T/~« A A /^T TT T /-^/-^ TV TV TV 

GUGAACUU G GCGGAAGA 


5281 


on do 

Z i^Z 


A OO A OT'OT' OOOT" A OOT" A O A A OO A OT«T>OOOOA 

AGGAGIGI GGG 1 AGG 1 AG AAGGA GTTGCGCA 


2503 


T T^ r^r^ TV TV TV TV TV i^T T/^^T T 

UGCGGAAG A ACAGUGGU 


5282 


onc^ 
z /oo 


rnrnf^ A OO A O OOOT'AOOT'A OA A OOA m^rnO'l"l'00 

1 1 GAGGAG GGGTAGGTACAACGA TGTCTTCC 


2504 


TV TV TV TV r^T T^^^IT T^l TV TV 

GGAAGACA G GUGGUGAA 


5283 


O *7 ^ Q 

z /b O 


AAG 1 1 GAG GGCTAGCTACAACGA CACTGTCT 


2505 


AGACAGUG G GUGAACUU 


5284 


onno 

Z 1 1 z 


AOOOA AOT' OOOT* A OOT" A O A A OO A T^OAOOAOT" 

AGGGAAG i GGG 1 AGG i AGAAGGA TCAG G ACT 


25 06 


TV T T/~1 ^ T TV TV TV /^T TT /^T T 

AGUGGUGA A ACUUCCCU 


5285 


z / o u 


T'OT'T'OT'A O /nOOT* A OOT" A O A A OO A AOOOA AOTi 

1 G 1 1 G 1 AG GGG 1 AGGTAC AACGA AGGGAAGT 


2507 


ACUUCCCU G GUAGAAGA 


5286 


O T Q T 
Z i O 1 


OOOO'POO'P OOOT" AOOT A O A A OO A OrnfTiOTtAOA 

GGGG i GG 1 GGG TAGCT AC AACGA CTTCTACA 


2508 


UGUAGAAG A ACGAGGCC 


5287 


onoo 
Z 1 )y z 


A O O O O OOOT* A OOT* A O A A OO A /•niiy~i^m/ niim 

GGGAGGGG GGG 1 AGG TAG AACGA CTCGTCTT 


2509 


AAGACGAG G GCCCUGGG 


5288 


Z / 


OOT^OOOA O OOOT" A OOTi A O A A OO A OOAOO/~»/~i^ 

GG i GGGAG GGG i AGG 1 AGAAGGA CCAGGGCC 


2510 


GGCCCUGG G GUGGCACG 


5289 


OQf\0 
ZOKJZ 


A^^OOT'OO OOOT^ A OOT A O A A OO A OAOOOAOO 

AGGGGIGG GGGTAGGTACAACGA CACCCAGG 


2511 


CCUGGGUG G GCACGGCU 


5290 


ZoU'i 


AAAOOOOT O OOT" A OOT" A On TV OO A r^Tk /^/^/^"IV 

AAAGGGGi GGGTAGGTACAACGA GCCACCCA 


2512 


UGGGUGGC A ACGGCUUU 


5291 


z o U / 


AOAAAAOO OOOT" A OOT» A O A A OO A 0/~i Tt/~i 7\ 

AGAAAAGG GGG i AGGTAC AACGA CGTGCCAC 


2513 


GUGGCACG G GCUUUUGU 


5292 


O Q 1 1 
Z 0 lo 


A T'OT'O A A O OOOT" A OOT» A O A 7\ OO A AAAA/^/~l/^/^ 

AiGiGAAG GGGTAGGTACAACGA AAAAGCCG 


2514 


CGGCUUUU G GUUCAGAU 


5293 


OQI Q 


r^r^r^c^ oat* ooot'aoo'pao a aooa oiioaaoaiv 
GGGGGCAi GGGTAGGTACAACGA CTGAACAA 


2515 


UUGUUCAG A AUGCCGGC 


5294 


O QO 1 
Zo Z L 


r^r^r^r^f^f^t^t^ oooinAOoniA oa ■» o/^ a t\ m /^m/^ iv tv 

GGGGGGGC GGCTAGCTACAACGA ATCTGAAC 


2516 


GUUCAGAU G GCCGGCCC 


5295 


O O T C 
Z O Z 3 


CCGTGGGC GGCTAGCTACAACGA CGGCATCT 


2517 


AGAUGCCG G GCCCACGG 


5296 


O fl Q Q 

z o z y 


T A O O O OO OOOT" A O OTi A O A A OO A r^fM^f^f^ryf^f^ 

i AGGGGG i GGG 1 AGGTACAACGA GGGCCGGC 


2518 


GCCGGCCC A ACGGCCUA 


5297 


OWiO 
ZO^ z 


OAA'PAOOO OOOn^ A OOm A O A A OO A oomo/^^^^ 

GAAIAGGG GGGTAGGTACAACGA CGTGGGCC 


2519 


GGCCCACG G GCCUAUUC 


5298 


z o J b 


AOOOOAAT" OOOT* A OOrp A O A A OO A ^ r^f^f^nfyVTyn 

AGGGGAAi GGCTAGCTACAACGA AGGCCGTG 


2520 


CACGGCCU A AUUCCCCU 


5299 


0 fl A c: 
Z oft 3 


c*r*r*r^r*r^T\f^ ooot'aoot'aoa aooa oaooooaa 
GGLGGGAG GGG 1 AGGTACAACGA CAGGGGAA 


2521 


UUCCCCUG G GUGCGGCC 


5300 


0 P A 
Z Ori / 


OA r^t~* r^r^r^ ooot'aoot'aoa aooa T^f^ryT^r*r*r*ry 
GAGGGGGG GGG lAGGl AGAAGGA ACCAGGGG 


2522 


CCCCUGGU G GCGGCCUG 


5301 


o Q c r\ 
zobO 


ry TV r^r^r^ ooo*n a oom a O a a ^r^n ^^f^^ t^r^yi f> 

GAGCAGGC GGCTAGCTACAACGA CGCACCAG 


2523 


CUGGUGCG G GCCUGCUG 


5302 


z o d4 


GGAGLAGC GGCTAGCTACAACGA AGGCCGCA 


2524 


UGCGGCCU G GCUGCUGG 


5303 


2857 


TATCCAGC GGCTAGCTAPAArriA ACCACCCC 


0^0^ 
z O Z 3 


ooppTTPr'TT P PPTTPPBTTa 
oov^UUotwU yj GGUGGAUA 




2862 


CCGGGTAT GGCTAGCTACAACGA CCAGCAGC 


2526 


GCUGCUGG A AUACCCGG 


5305 


2864 


GTCCGGGT GGCTAGCTACAACGA ATCCAGCA 


2527 


UGCUGGAU A ACCCGGAC 


5306 


2870 


TCCAGGGT GGCTAGCTACAACGA CCGGGTAT 


2528 


AUACCCGG A ACCCUGGA 


5307 


2879 


CTCTGCAC GGCTAGCTACAACGA CTCCAGGG 


2529 


CCCUGGAG G GUGCAGAG 


5308 


2881 


CGCTCTGC GGCTAGCTACAACGA ACCTCCAG 


2530 


CUGGAGGU G GCAGAGCG 


5309 
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^ o o o 






GGUGCAGA G GCGACUAC 


5310 


9 P P Q 
Z O O J7 




253 2 


TV TV f^f^ TV TV /^T 7 TV /*^T T /~1 

GCAGAGCG A ACUACUCC 


5311 




LjL i \j\jPi\j 1 vjbL 1 ACjL 1 ALAACGA AGTCGCTC 


2533 


TV /"^ /^/~^ TV /^T T TV TV /^T T/*1 TV 

GAGCGACU A ACUCCAGC 


5312 


O Q QQ 


(jQjLAiAbL CjCjLIACjL lACAACGA TGGAGTAG 


2534 


CUACUCCA G GCUAUGCC 


5313 


T Q 1 


CCGGGCAT GGCTAGCTACAACGA AGCTGGAG 


2535 


CUCCAGCU A AUGCCCGG 


5314 


2903 


GTCCGGGC GGCTAGCTACAACGA ATAGCTGG 


2536 


CCAGCUAU G GCCCGGAC 


5315 


T Q n Q 


A i GGAGG r GGCTAGCTACAACGA CCGGGCAT 


2537 


AUGCCCGG A ACCUCCAU 


5316 




uLiLiGAl GGCTAGCTACAACGA GGAGGTCC 


2538 


GGACCUCC A AUCAGAGC 


5317 


^ y Z ± 


AGALIGGC GGCIAGCTACAACGA TCTGATGG 


2539 


CCAUCAGA G GCCAGUCU 


5318 




GG i GAGAL GGC i AGC rAC AACGA TGGCTCTG 


254 0 


CAGAGCCA G GUCUCACC 


5319 


o Q 1 n 

Z J7 J U 


i i IjAAvjo 1 GGL- i AGG i ALAALGA GAGAL i GG 


2 541 


CCAGUCUC A ACCUUCAA 


5320 


O Q 1 "7 


GGGGLGG i GGC i AGC X ACAACGA TGAAGGTG 


2542 


CACCUUCA A ACCGCGGC 


5321 




GAAGCCGC GGCTAGCTACAACGA GGTTGAAG 


2543 


CUUCAACC G GCGGCUUC 


5322 


z y 4 J 


C 1 1 GAAGC GGCTAGCTACAACGA CGCGGTTG 


2544 


CAACCGCG G GCUUCAAG 


5323 


Q C 1 


CICCCAGC GGCTAGCTACAACGA CTTGAAGC 


2545 


GCUUCAAG G GCUGGGAG 


5324 


T Q C 1 

z ybi 


ACGCATGT GGCTAGCTACAACGA TCCTCCCA 


2546 


UGGGAGGA A ACAUGCGU 


5325 


z y o J 


GGAGGGAT GGCTAGCTACAACGA GTTCCTCC 


2547 


GGAGGAAC A AUGCGUCG 


5326 


z y o D 


IGGGALGL GGC i AGC i ACAACGA ATGTTCCT 


2548 


AGGAACAU G GCGUCGCA 


5327 


z y o / 


ill GGGAL GGC 1 AGC i ACAACGA GCATGTTC 


2549 


GAACAUGC G GUCGCAAA 


5328 


•5 Q'7A 


GAGlllGC GGC 1 AG CT AC AACGA GACGCATG 


2550 


CAUGCGUC G GCAAACUC 


5329 


Zy i ** 


^AAAOA^T* ^/^/^rp TV ^/^rp TV A A /^/^ A mm^^zTA^/^ 

GAAAGAGi GGCTAGCTACAACGA TTGCGACG 


2551 


CGUCGCAA A ACUCUUUG 


5330 


O Q Q 4 

z y o4 


LGGAAGAC GGCTAGCTACAACGA CCCAAAGA 


2552 


UCUUUGGG G GUCUUGCG 


5331 


O Q Q Q 

z y c5 y 


iCAGCCGC GGCTAGCTACAACGA AAGACCCC 


2553 


GGGGUCUU G GCGGCUGA 


5332 


O Q Q O 

z y yz 


ACTTCAGC GGCTAGCTACAACGA CGCAAGAC 


2554 


GUCUUGCG G GCUGAAGU 


5333 


O Q QQ 

z y y o 


IGIGACAC GGCTAGCTACAACGA TTCAGCCG 


2555 


CGGCUGAA G GUGUCACA 


5334 


1 r\r\r\ 


GCTGTGAC GGCTAGCTACAACGA ACTTCAGC 


2556 


GCUGAAGU G GUCACAGC 


5335 


^5 A n "J 


CAGGCIGT GGCTAGCTACAACGA GACACTTC 


2557 


GAAGUGUC A ACAGCCUG 


5336 


.3 u u o 


AAA<^A/^^/^ ^/^^T^ A ^/^Ti A z^' A A A rp/*^rp/^ TV /~i A /~i 

AAAL.AGGL. GGC 1 AGC T AC AACGA TGTGACAC 


2558 


GUGUCACA G GCCUGUUU 


5337 


J U J. u 


AGAAAv- GGC i AGC i ACAACGA AGGCTGTG 


2559 


CACAGCCU G GUUUCUGG 


5338 


"3 n 1 Q 

O U 1 €5 


/^rpy^ ^ A A A •T /^O/^rp a/^/^tptv/^a a a /^/^tv/^tvtvtv/~i 

L i GLAAA 1 GGCTAGCTACAACGA CCAGAAAC 


2560 


GUUUCUGG A AUUUGCAG 


5339 




rp/^ TV /^/^rp^ /T /^rri TV /^/^fTl TV TV TV /^/^ tv tv tv tv m/^ tv 

ILACCiGC GGCTAGCTACAACGA AAATCCAG 


2561 


CUGGAUUU G GCAGGUGA 


5340 


o UZ b 


^•rp^li 1 11 1 i^^TV ^1 1 1 TV ^% ^irri tv tv tv tv /^m^ tv tv tv m 

GlGrTCAC GGCTAGCTACAACGA CTGCAAAT 


2562 


AUUUGCAG G GUGAACAG 


5341 


J U J u 


GAGGCTGT GGCTAGCTACAACGA TCACCTGC 


2563 


GCAGGUGA A ACAGCCUC 


5342 


"3 n "3 

J U J J 


CTGGAGGC GGCTAGCTACAACGA TGTTCACC 


2564 


GGUGAACA G GCCUCCAG 


5343 




TV TV f^r^r^ rp ryf*^f ti i » tv /^rri TV TV TV /^/^ tv y^m/^ m 

CACACCGT GGCTAGCTACAACGA CTGGAGGC 


2565 


GCCUCCAG A ACGGUGUG 


5344 


o n >l yl 
J U44 


GTGCACAC GGCTAGCTACAACGA CGTCTGGA 


2566 


UCCAGACG G GUGUGCAC 


5345 


T n /I 


TGGTGCAC GGCTAGCTACAACGA ACCGTCTG 


2567 


CAGACGGU G GUGCACCA 


5346 


1 r\ A o 


GTTGGTGC GGCTAGCTACAACGA ACACCGTC 


2568 


GACGGUGU G GCACCAAC 


5347 


T r* c n 


ATGTTGGT GGCTAGCTACAACGA GCACACCG 


2569 


CGGUGUGC A ACCAACAU 


5348 


o n c yi 
jUb4 


GTAGATGT GGCTAGCTACAACGA TGGTGCAC 


2570 


GUGCACCA A ACAUCUAC 


5349 


1 n c t:: 

J U D D 


1 I G i AGA I GGCTAGCTACAACGA GTTGGTGC 


2571 


GCACCAAC A AUCUACAA 


5350 


3060 


GATCTTGT GGCTAGCTACAAPdA AfiATOTTn 




^AAuAUL.U A AL.AAGAUU 


bo bl 


3065 


AGGAGGAT GGCTAGCTACAACGA CTTGTAGA 


2573 


UCUACAAG A AUCCUCCU 


5352 


3073 


CCTGCAGC GGCTAGCTACAACGA AGGAGGAT 


2574 


AUCCUCCU G GCUGCAGG 


5353 


3076 


ACGCCTGC GGCTAGCTACAACGA AGCAGGAG 


2575 


CUCCUGCU G GCAGGCGU 


5354 


3080 


CTGTACGC GGCTAGCTACAACGA CTGCAGCA 


2576 


UGCUGCAG G GCGUACAG 


5355 


3082 


ACCTGTAC GGCTAGCTACAACGA GCCTGCAG 


2577 


CUGCAGGC G GUACAGGU 


5356 
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3084 


AAACCTGT GGCTAGCTACAACGA ACGCCTGC 


2578 


GCAGGCGU A ACAGGUUU 


5357 


3088 


CGTGAAAC GGCTAGCTACAACGA CTGTACGC 


2579 


GCGUACAG G GUUUCACG 


5358 


1 A Q 


ACATGCGT GGCTAGCTACAACGA GAAACCTG 


2580 


CAGGUUUC A ACGCAUGU 


5359 


"5 A O C 


TV TV TV n^/^/^ /*! /^m tv /^rn tv tv tv /^f^ iv mr^ tv tv tv 

ACACATGC GGCTAGCTACAACGA GTGAAACC 


2581 


GGUUUCAC G GCAUGUGU 


5360 


O A Q T 


GCACACAT GGCTAGCTACAACGA GCGTGAAA 


2582 


UUUCACGC A AUGUGUGC 


5361 


0 A Q Q 

1 u y y 


CAGCACAC GGCTAGCTACAACGA ATGCGTGA 


2583 


UCACGCAU G GUGUGCUG 


5362 


O T A 1 


T^/"*/"* 7\ 7\ /~l t^l^ /~im 7\ /—I /~(m TV /~1 TV TV /^/^ TV TV TV m/~\ /— 1/^ m 

TGCAGCAC GGCTAGCTACAACGA ACATGCGT 


2584 


ACGCAUGU G GUGCUGCA 


5363 


O 1 A 1 


GCTGCAGC GGCTAGCTACAACGA ACACATGC 


2585 


GCAUGUGU G GCUGCAGC 


5364 


3105 


GGAGCTGC GGCTAGCTACAACGA AGCACACA 


2586 


UGUGUGCU G GCAGCUCC 


5365 


3109 


ATGGGAGC GGCTAGCTACAACGA TGCAGCAC 


2587 


GUGCUGCA G GCUCCCAU 


5366 


olio 


TV rp TV TV TV rp /^m TV /^m TV /~i TV TV y^/^ tv /*i tv /"if 

GA TGAAAT GGCTAGCTACAACGA GGGAGCTG 


2588 


CAGCUCCC A AUUUCAUC 


5367 


J IzU 


rprn^/^n^/^ tv rp /^/^/^m tv z*^ ^^tm tv tv tv /^/^ tv tv tv tv m^^^y^ 

1 i GLTGAT GGCTAGCTACAACGA GAAATGGG 


2589 


CCCAUUUC A AUCAGCAA 


5368 




TV "A "A 1 II 1 1^ r*f^ f^rn TV yii ii iv 71 tv r^r^ tv m/^ tv pti/~i tv tv tv 

AAACITGC GGCTAGCTACAACGA TGATGAAA 


2590 


UUUCAUCA G GCAAGUUU 


5369 


3 1 4i 0 


rprp<~i/0 7\ TV TV 0 /"I /~irp 7\ /~\ TV TV TV /~i<^ TV /^m/~i tv m 

i i LLAAAC GGCTAGCTACAACGA TTGCTGAT 


2591 


AUCAGCAA G GUUUGGAA 


5370 


3 13 0 


i G 1 GGGGT GGCTAGCTACAACGA TCTTCCAA 


2592 


UUGGAAGA A ACCCCACA 


5371 


3 143 


AAAAA i G i GGL i AGC 1 ACAALGA GGGGTTCT 


2593 


AGAACCCC A ACAUUUUU 


5372 


314d 


GGAAAAAT GGCTAGCTACAACGA GTGGGGTT 


2594 


AACCCCAC A AUUUUUCC 


5373 


3 154 


TGACGCGC GGCTAGCTACAACGA AGGAAAAA 


2595 


UUUUUCCU G GCGCGUCA 


5374 


3156 


GATGACGC GGCTAGCTACAACGA GCAGGAAA 


2596 


UUUCCUGC G GCGUCAUC 


5375 


3158 


GAGATGAC GGCTAGCTACAACGA GCGCAGGA 


2597 


UCCUGCGC G GUCAUCUC 


5376 


3161 


TCAGAGAT GGCTAGCTACAACGA GACGCGCA 


2598 


UGCGCGUC A AUCUCUGA 


5377 


3 loo 


GGCCGTGT GGCTAGCTACAACGA CAGAGATG 


2599 


CAUCUCUG A ACACGGCC 


5378 


3 1 / U 


GAGGCCGT GGCTAGCTACAACGA GTCAGAGA 


2600 


UCUCUGAC A ACGGCCUC 


5379 


31/3 


AGGGAGGC GGCTAGCTACAACGA CGTGTCAG 


2601 


CUGACACG G GCCUCCCU 


5380 


3 lo3 


GGAGTAGC GGCTAGCTACAACGA AGAGGGAG 


2602 


CUCCCUCU G GCUACUCC 


5381 


3 1 OD 


TV •y O TV ^/"t^T* A O/^m TV /"I A A TV TV A /"I A 

GAIGGAGT GGCTAGCTACAACGA AGCAGAGG 


2603 


CCUCUGCU A ACUCCAUC 


5382 


3 ly 1 


TT\ fry TV f^f^ TV ^^/^/^rri TV /^rn tv tv tv /^/^ tv /-i tv m tv /-^ 

1 i LAGGAT GGCTAGCTACAACGA GGAGTAGC 


2604 


GCUACUCC A AUCCUGAA 


5383 


T 0 A A 


TTCTTGGC GGCTAGCTACAACGA TTTCAGGA 


2605 


UCCUGAAA G GCCAAGAA 


5384 


O 0 A "7 
3^0 / 


CCCTGCGT GGCTAGCTACAACGA TCTTGGCT 


2606 


AGCCAAGA A ACGCAGGG 


5385 


"5 0 A Q 

3 oy 


ATCCCTGC GGCTAGCTACAACGA GTTCTTGG 


2607 


CCAAGAAC G GCAGGGAU 


5386 


3zlb 


AGCGACAT GGCTAGCTACAACGA CCCTGCGT 


2608 


ACGCAGGG A AUGUCGCU 


5387 


3 Z 1 / 


CCAGCGAC GGCTAGCTACAACGA ATCCCTGC 


2609 


GCAGGGAU G GUCGCUGG 


5388 


Q O 0 A 
J <C Z U 


f^f^f^f^r^ TV /*! /"irri TV /^m tv tv tv ^^/^ tv tv ^^tv m^^j^/^ 

LCLCCAGC GGCTAGCTACAACGA GACATCCC 


2610 


GGGAUGUC G GCUGGGGG 


5389 


1 O T T 

3^^ / 


LCLITGGC GGCTAGCTACAACGA CCCCAGCG 


2611 


CGCUGGGG G GCCAAGGG 


5390 


"3 0 "5 A 
3 ^3*1 


GGLGGCGC GGCTAGCTACAACGA CCTTGGCC 


2612 


GGCCAAGG G GCGCCGCC 


5391 


3^3o 


LL-GGCGGC GGCTAGCTACAACGA GCCCTTGG 


2613 


CCAAGGGC G GCCGCCGG 


5392 


3 23 9 


GGGCCGGC GGCTAGCTACAACGA GGCGCCCT 


2614 


AGGGCGCC G GCCGGCCC 


5393 


3243 


CAGAGGGC GGCTAGCTACAACGA CGGCGGCG 


2615 


CGCCGCCG G GCCCUCUG 


5394 


3250 


CGGAGGGC GGCTAGCTACAACGA AGAGGGCC 


2616 


GGCCCUCU G GCCCUCCG 


5395 


3260 


TGCACGGC GGCTAGCTACAACGA CTCGGAGG 


2617 


CCUCCGAG G GCCGUGCA 


5396 


3 2 6 3 


CACTGCAC GGCTAGCTACAACGA GGCCTCGG 


2618 


CCGAGGCC G GUGCAGUG 


5397 


3265 




^ o ± y 


UA(a(jLLGU G GLAGUGGC 


5398 


3268 


KCmQQKQ, GGCTAGCTACAACGA TGCACGGC 


2620 


GCCGUGCA G GUGGCUGU 


5399 


3271 


GGCACAGC GGCTAGCTACAACGA CACTGCAC 


2621 


GUGCAGUG G GCUGUGCC 


5400 


3274 


GGTGGCAC GGCTAGCTACAACGA AGCCACTG 


2622 


CAGUGGCU G GUGCCACC 


5401 


3276 


TTGGTGGC GGCTAGCTACAACGA ACAGCCAC 


2623 


GUGGCUGU G GCCACCAA 


5402 


3279 


TGCTTGGT GGCTAGCTACAACGA GGCACAGC 


2624 


GCUGUGCC A ACCAAGCA 


5403 
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•3 O Q >l 


A(i(jAAitrL CjCjLTAGCTACAACGA TTGGTGGC 


2625 


f~\r^r^ TV /^/^ A TV ^ TV T TT T/^^^T T 

GCCACCAA G GCAUUCCU 


5404 


•5 O Q ^ 
0 ZOO 


CjCAGGAAT ggctagctacaacga gcttggtg 


2626 


TV /^f^ TV TV TV TV T TT Ti^ /^T 

CACCAAGC A AUUCCUGC 


5405 


T O Q 0 


rp rp/^ TV m tv /^m tv tv tv f^r^ tv tv r^r^ tv tv m/~i 

GLTTGAGC GGCTAGCTACAACGA AGGAATGC 


2627 


GCAUUCCU G GCUCAAGC 


5406 


T O Q 0 


TV rn TV i~^TT\ TV /^m TV tv tv tv mm/^ tv ^ tv ^ 

GAGTCAGC GGCTAGCTACAACGA TTGAGCAG 


2628 


CUGCUCAA G GCUGACUC 


5407 


•a "a A 0 


TGTCGAGT GGCTAGCTACAACGA CAGCTTGA 


2629 


T T/^ TV TV y^T ■» ti y^x T/^/^ TV TV 

UCAAGCUG A ACUCGACA 


5408 


3307 


CACGGTGT GGCTAGCTACAACGA CGAGTCAG 


2630 


CUGACUCG A ACACCGUG 


5409 


"5 "2 n Q 


GACALGGT GGCTAGCTACAACGA GTCGAGTC 


2631 


GACUCGAC A ACCGUGUC 


5410 




GCjTGACAC GGCTAGCTACAACGA GGTGTCGA 


2632 


UCGACACC G GUGUCACC 


5411 


J J 14 


iAGGTGAC GGCTAGCTACAACGA ACGGTGTC 


2633 


GACACCGU G GUCACCUA 


5412 


J J 1 / 


ACGTAGGT GGCTAGCTACAACGA GACACGGT 


2634 


ACCGUGUC A ACCUACGU 


5413 


"3 "3 0 T 


T'O/^/^A^OT^ O/^/^rn TV #^rn TV TV TV /^/^ T\ TV /^i'^rn/^ TV TV 

iGGCACGT GGCTAGCTACAACGA AGGTGACA 


2635 


UGUCACCU A ACGUGCCA 


5414 


T 0 "a 


AOTi^/^OA^ ^/^^rmv TV /~i TV TV /~»^ TV /^mTV /^r^m^ tv 

AGTGGCAC GGCTAGCTACAACGA GTAGGTGA 


2636 


UCACCUAC G GUGCCACU 


5415 


■3 "a o c 


GGAGTGGC GGCTAGCTACAACGA ACGTAGGT 


2637 


ACCUACGU G GCCACUCC 


5416 


1 "5 0 Q 
-5 J Z 0 


CCAGGAGi GGLTAGCTACAACGA GGCACGTA 


2638 


TTTV y^y^TT^y^/^ tv tv y^T T/^y^T 

UACGUGCC A ACUCCUGG 


5417 


"3 "3 T 


i GAG i GAC GGC lAGCTACAACGA CCCAGGAG 


263 9 


CUCCUGGG G GUCACUCA 


5418 


"3 "34 n 


iCClGAGl GGCTAGCTACAACGA GACCCCAG 


2640 


CUGGGGUC A ACUCAGGA 


5419 


•3 '3 A "7 


rp^O/^/^rp^pp /^r^ /^rri TV 0/~irp TV /"» TV TV^/^TV /^/~tm/*x TV 

IGGGCTGT GGCTAGCTACAACGA CCTGAGTG 


2641 


CACUCAGG A ACAGCCCA 


5420 


T "3 c: 0 
J J D U 


G i L iGGGC GGC i AGL 1 ACAACGA TGTCCTGA 


2642 


UCAGGACA G GCCCAGAC 


5421 


"3 "3 c: ^ 
J J OO 


AGL i GLG 1 GGCTAGL i ACAACGA CTGGGCTG 


2643 


CAGCCCAG A ACGCAGCU 


5422 


J 0 3 0 


i CAGLTGC GGCTAGCTACAACGA GTCTGGGC 


2644 


GCCCAGAC G GCAGCUGA 


5423 


"3 "a c 1 
J J 0 X 


GAL 1 CAGC GGCTAGCTACAACGA TGCGTCTG 


2645 


CAGACGCA G GCUGAGUC 


5424 


1 "3 c 
J J b b 


C 1 1 CCGAC GGCTAGCTACAACGA TCAGCTGC 


2646 


GCAGCUGA G GUCGGAAG 


5425 


"3 "3 T 0 


CCGGGAGC GGCTAGCTACAACGA TTCCGACT 


2647 


AGUCGGAA G GCUCCCGG 


5426 


"a "a Q "a 


AGCGTCGT GGCTAGCTACAACGA CCCCGGGA 


2648 


UCCCGGGG A ACGACGCU 


5427 


"3 '3 Q C 
J J 00 


G 1 CAGCGT GGCTAGCTACAACGA CGTCCCCG 


2649 


CGGGGACG A ACGCUGAC 


5428 


'J Q Q 

J J 0 0 


LAGILAGC GGCTAGCTACAACGA GTCGTCCC 


2650 


GGGACGAC G GCUGACUG 


5429 


^3 "3 Q 0 


TV r^r^ tv rp ^/^Ofr^TV/^/^rnTV /^tv tv /™«/^tv /"I TV /^/-n/^rp/~i/~i 

AGGGCAGl GGCTAGCTACAACGA CAGCGTCG 


2651 


CGACGCUG A ACUGCCCU 


5430 


"3 'a Q C 

J J y 3 


ICCAGGGC GGCTAGCTACAACGA AGTCAGCG 


2652 


CGCUGACU G GCCCUGGA 


5431 


"3 /! A /I 


GCrGCGGC GGCTAGCTACAACGA CTCCAGGG 


2653 


CCCUGGAG G GCCGCAGC 


5432 


"3 A n T 
J ft U / 


liGGClGL GGCTAGCTACAACGA GGCCTCCA 


2654 


UGGAGGCC G GCAGCCAA 


5433 


"3 A T n 
J4XU 


GGGiiGGC GGCTAGCTACAACGA TGCGGCCT 


2655 


AGGCCGCA G GCCAACCC 


5434 


■5 A 1 A 


J. GCCGGGT GGCTAGCTACAACGA TGGCTGCG 


2656 


CGCAGCCA A ACCCGGCA 


5435 


1 4 1 Q 

J ft X y 


i^r^f^ TV ^^TT/*i /^/^/^rri TV /^/^rn TV tv tv /^/^ tv mmy^/^ 

GGCAGTGC GGCTAGCTACAACGA CGGGTTGG 


2657 


CCAACCCG G GCACUGCC 


5436 


■3 A 0 1 
J 4 Z X 


AGGGCAGT GGCTAGCTACAACGA GCCGGGTT 


2658 


AACCCGGC A ACUGCCCU 


5437 


T A 0 A 


/^rp/t TV f^f^f^ f^r^ TV /^/^m tv tv tv /^/^ tv tv /~t m ^y^/^y^ 

CiGAGGGC GGCTAGCTACAACGA AGTGCCGG 


2659 


CCGGCACU G GCCCUCAG 


5438 


"3 A "3 0 
J ft J Z 


/^rnrpr^ TV A OTt /^^/^rp A /^/^iTfTV TV TV ^/^tv /*hii/i tv ^r^x^^ 

C 1 i GAAG i GGCTAGCTACAACGA CTGAGGGC 


2660 


GCCCUCAG A ACUUCAAG 


5439 


"3 /I n 
0 4 4 u 


AGGAIGGI GGCTAGCTACAACGA CTTGAAGT 


2661 


ACUUCAAG A ACCAUCCU 


5440 


y1 y1 1 
J 4 4 J 


TCCAGGAT GGCTAGCTACAACGA GGTCTTGA 


2662 


UCAAGACC A AUCCUGGA 


5441 


"3 4 c n 


f^f~^ A T^/^ A /^ITT A ^^nn TV TV TV TV /^/^ tv tv m/^ 

CCAICAGT GGCTAGCTACAACGA CCAGGATG 


2663 


CAUCCUGG A ACUGAUGG 


5442 


A C A 


GTGGCCAT GGCTAGCTACAACGA CAGTCCAG 


2664 


CUGGACUG A AUGGCCAC 


5443 


34 5 7 


CGGGTGGC GGCTAGCTACAACGA CATCAGTC 


2665 


GACUGAUG G GCCACCCG 


5444 


3460 




z b b o 


UGAUGGLL A ALCCGCCC 


5445 


3464 


CTGTGGGC GGCTAGCTACAACGA GGGTGGCC 


2667 


GGCCACCC G GCCCACAG 


5446 


3468 


CTGGCTGT GGCTAGCTACAACGA GGGCGGGT 


2668 


ACCCGCCC A ACAGCCAG 


5447 


3471 


GGCCTGGC GGCTAGCTACAACGA TGTGGGCG 


2669 


CGCCCACA G GCCAGGCC 


5448 


3476 


CTCTCGGC GGCTAGCTACAACGA CTGGCTGT 


2670 


ACAGCCAG G GCCGAGAG 


5449 


3483 


GTGTCTGC GGCTAGCTACAACGA TCTCGGCC 


2671 


GGCCGAGA G GCAGACAC 


5450 
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34 87 




z 0 / z 


PAPAPPAP a 'h.m^nn'^nn 

LtAvjALjUAVj a AL.AV,L.AvjV_ 




34 89 




4i b / J 


LjAvjL-ALjAL. a AL.L.AVjLAVj 




3493 


ACzaac^cir' nf^PTAnPTAPaaprizi tpptptpt 


0 a 1 A 


ar'ana/^r'A n rT'T^rrrr'n 
ALjAUACLA 0 oLrAoLCL-U 




3496 


dACACztinr* pnPTZinPTZipaziPna Tfin^nfyvn 


D C T C 


oaor^a/^r^A r* r*r^r^r^nr^T jr^ 
LALLAvjLA (j vjLLLUCjUL 


5454 


3501 


nnncyrci'L.n nnPTZinpTapzviiPPZi '&n(ir*r"vr'n 




/~i TV r^r^r^r^^ t 0 ott/^ao/^/^o 
(jLAUCL-HJ vj vjULALCjCL 


5455 


J Z) u *± 




Zb 1 1 


CjLllucjUc. a ALGLLGGG 


5456 


3506 


Adnncacir' nnpTafjpTap^ziPnzv nTf^apapp 


Zo / 0 


uL.UtjUL.AU vj VjLLVjtjLiLU 


b4b / 


3511 


POTAf^AfiP nnPTAf^PTapaAPna nncifirTyrn 


Zo /y 


L.AHjLHjvj Vj VjLULUALLj 


C A C 0 

b4 bo 


3 516 


TCiaai^cciT nnpTAnPTapaixpna AnAnpppfn 




V^VjLrtjLULU A ALCjULLLA 




3518 






LtLtLULUAC G GULLCAGG 


5460 


3535 


TCiaaoc'Cic nr^PT2xnPTAPZiAppzi ppptpppt 


0 c Q 0 
ZooZ 


ACjCjCjAvjCjCj Kj GLGGCLCA 


54 61 


3538 


nTnTrtnrjp rzf^PTA/^PTapaappa pppppptp 


zoo J 


(jAGGGGlG G GlCCACAC 


5462 


3 54 2 


PTnrjrJTfiT nnPTA^^PTZvpaappa r:r*nr*r*r*r*n 




CjGlGOlll a acacccag 


5463 


3 544 


PPPTOPPT npPTAPr'T'ar'a aona r'TT'/^rT'or' 


0 /T 0 C 


CGGCCCAC A ACCCAGGC 


5464 


3550 


OTfZPonrzp nnpT'APr'T'ar'aarT'a r'TTi/^r'TT'T' 

Vj X Vj^ooVj^ VjVjU X i\\3\^ X HS^i\n\J<jJ\ i viVJO i Vj i 


Zooo 


AOAO/^/^A/^ /^/^/^f^r^f^Ti 
AlALLLAG G GCLLGLAC 


5465 


3554 


AfipnnTnp nnPTaf^PTapaixpna nnnppTn/^ 


Zoo I 


LLALjCjCLL Cj (jLALLGlU 


5466 


3 556 


ppAnrnfiT rinPTAfiPTapaAPna cicnnnnrT 


Zooo 


AUVjLH-VjL a ALLvjL.UIjVj 


5467 


3559 


prppppTvrip fZnPTAfJPTaPAZXPPA PPTfJPPPP 


9 H Q 

z D 0 y 


LLLoLALL Lr vjUULtLjoAo 


C A C 0 

b4 6 0 


3566 


PPTPARAP nRPTAnPTAPAAPHA TPPPAPPP 


ZO 


LvjLUoLyLiA Li vjULULjAvjVj 


c A ^ a 
b4by 


3573 


APTPAnnP OfJPT Af5PTAP A A Pn A PTPAHAPT 


ZO 


AvjULUVjAo \j vjLLULrAvjU 


5470 


3579 


APAPTPAP finPTAOPTAPAAPf^A ^CLdfinn^ 


Zb 7Z 


AVjOLLUCjA G GUGAGUGU 


5471 


3583 


PPAAAPAP nnPTAPPTAP A APP A TPAPTPar* 


0 Q T 

z b y J 


/^T TO A /^T TO A 0 OTTOTTTTTT^/^ 

lUGAGUGA G GuGUUUGG 


5472 


3585 




2 6 94 


GAGUGAGU g guuuggcc 


5473 


3590 


npPTP^niP /^P/^'PAr»r"ra/^a a/^r» A /^aaaoa/^t 


2695 


AGUGUUUG G GCCGAGGC 


5474 


3 5 9 6 


/^ioi^ALioU vjVjL, i AVaC i AV^AAL-oA L. i. L.(j(jCvJA 


2696 


UGGCCGAG G GCCUGCAU 


5475 


3600 


PPAPATPP r^PPTarr^Tar^a ar*r»a a/^o/^/^m/^r' 


2697 


CGAGGCCU G GCAUGUCC 


5476 


3602 


PPOnAPAT nnPTAfJPTAPA APPA ClO'h.nr'nn'V 


0 £ Q 0 


ALrCjCGUGL A AUGUCCGG 


5477 


3604 


Af^PPPrtAP PPPTAPPTAPA APJ^a ATPPanrT* 


0 f 0 Q 


GlCUGCAU g guccggcu 


5478 


3609 


PPTTPAPP r^PPTAPPTAPA APPA rrr'ar'ATT' 


2700 


CAUGUCCG G GCUGAAGG 


5479 


3616 


PAPTPAPP PPPTAPPTAPA APPA m:**vm:i.r'nn 


T T 0 1 
Z / U 1 


GGlUGAAG g gcugagug 


5480 


3621 


PPPPAPAP PPPTAPPTar^a apT'a Tr'Ar«r'0'T"T« 
V,n_oVjAV-A^ VjVjjv^ X AVj^^ X AL.AAV-.vjA 1 L-AoLL, i i 


2702 


AAGGCUGA G GUGUCCGG 


5481 


3623 


APPPPPAP PPPTAPPTar^B arT'A t^r^^r^iKt^r^r* 
riVjV-.V-.oVjAL vjVjL X AVjL 1 Av-AAV^oA AL 1 L.AOL.L- 


2703 


GGCUGAGU G GUCCGGCU 


5482 


362 8 


PPPTPZiPP PPPTAPPTAPAzvppa rr^r'ar'ar'T 


2704 


AGUGUCCG G GCUGAGGC 


5483 


3634 


n.C^'VO'L.n.rxr* PPPTapr^'par^aarr^a r*n^r*T\,r>r>r*r* 


2 705 


CGGCUGAG G GCCUGAGC 


5484 


3640 


APAPTPPP PPPT'A^^'pa /^a a A tt^ao/^o/^t' 

AX-AV- X v^VjL vjVjL X ALjL. X A^^ AAL.VjA 1 L.AvjtjV_L. 1 


2706 


AGGCCUGA G GCGAGUGU 


5485 


3644 


PTriPAPAP PPPTAPPTAPA APP A 'T'pr»r"T"r*Br' 


2707 


CUGAGCGA G GUGUCCAG 


5486 


3646 


PPPTPPAP Pnr'T APPT A A A r^/^ a aoT'r'/^r^T'r^ 
VJNJV», X vjoi-iV- i /\VjL. X /\>^/\AL.oA AL-iLVjL.XL. 


2708 


GAGCGAGU G GUCCAGCC 


5487 


3651 


^^-L J- -L VjVjV* VjoL X AoL X ACAALVjA 1 Vj\jAV,AC i 


2 70 9 


AGUGUCCA G GCCAAGGG 


5488 


3658 


PAPTPAPP PPPTAPPTar^a arr^a r^r*'VTr>r*r*T^ • 

\^t\\^ X V^/\VJV-. VjVjL, X /WjV- X AV^AAV-VJA 1 1 X 


2710 


TV ^^/^/^ TV TV A^/^ /^T TV ✓^•T T/^ 

AGCCAAGG G GCUGAGUG 


5489 


3663 


PTPIPAPAP PPPT A PPT a a a /^r* A Tr'a/^/^/^OT* 
L. X \3\3t\\^I\\^ \3\3\^ 1 AVjL X AV^ AAL. Vj A I L. Alj C L C 1 


2711 


AGGGCUGA G GUGUCCAG 


5490 


3665 


TPPTPPAP PPPT" a f^r^T" a a a A ar^Ti^ a/^/^/^ 


2712 


GGCUGAGU G GUCCAGCA 


5491 


3670 


AGGTGTGC GGCTAGCTACAACGA TGGACACT 


2713 


AGUGUCCA G GPAPAPPTT 

X*VJ V.^VJ Vw* V—r^ VJ VJVirfX^\^rt.\w v^ u 




3672 


GCAGGTGT GGCTAGCTACAACGA GCTGGACA 


2714 


UGUCCAGC A ACACCUGC 


5493 


3674 


CGGCAGGT GGCTAGCTACAACGA GTGCTGGA 


2715 


UCCAGCAC A ACCUGCCG 


5494 


3678 


AAGACGGC GGCTAGCTACAACGA AGGTGTGC 


2716 


GCACACCU G GCCGUCUU 


5495 


3681 


GTGAAGAC GGCTAGCTACAACGA GGCAGGTG 


2717 


CACCUGCC G GUCUUCAC 


5496 


3687 


GGGGAAGT GGCTAGCTACAACGA GAAGACGG 


2718 


CCGUCUUC A ACUUCCCC 


5497 
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A 1 Ly 


ALUULLLL A ALAGGLUG 




J o y y 




2120 


LLLLALAG G GLUGGLGL 


54 99 


"7 m 

J / U J 




2721 


ALAGGLUG G GLGLUCGG 


5500 






O "7 n O 

4d 1 2.2. 


AGGLUGGL G GCUCGGCU 


5501 


1 71 n 
J / ±\J 


1 kJVjAoL. i AVjU 1 AL.AAL\jA CvjAIjL.\jL.C 


2. iZS 


GGCGCuCG G GCUCCACC 


5502 


T 7 1 


L.L. i LjVjjvjij i LioL, i AoU 1 AL-AALoA L»vjALrLL.LjA 


2 724 


UCGGCUCC A ACCCCAGG 


5503 


T 79 "5 


AAVjL. 1 ooL. U^jL. 1 AIjL. i AL AAL.oA L. L i (jvjVjQj i 


2 725 


ACCCCAGG G GCCAGCUU 


5504 


1 7 9 "7 


r^r^T\ AA Tif^r^ /^r^/^TA/^^T'Ar'A A/^/^A m/^/^ ^/"'OT'O 

IjVjAAAALjL. VjVjL. i AxjL. i ALAALCjA 1 oGLCL TCj 


2726 


CAGGGCCA G GCUUUUCC 


5505 


"5 7*5 7 


U 1 L.^ 1 Ijvj i. ooL. i AuU 1 ACAAL.OA VjALtVvAAAA 




T TT TT TT T/"* /^T T/^ A TV O/^TV /^/^ TV 

UUUUCCUC A ACCAGGAG 


5506 


'i'7AA 
J / r4 *t 


T^r^r^r^r^r^r^f~* /^/^^t'a^^'t'a/^'a a^^ a t^/^'ot'^/^'t'/^ 


2728 


CACCAGGA G GCCCGGCU 


5507 


7 7 A Q 


Vj 1 VjVj AAVjC otaL. i AoU i AU AA Lj A L i L L 


2 72 9 


GGAGCCCG G GCUUCCAC 


5508 


1 7t^ ^ 


1 oVjVjoAo 1 oov^ 1 AoL. 1 AI,^AAL.VjA ^joAALjC 


O "7 T rk 

2 / J CJ 


CGGCUUCC A ACUCCCCA 


5509 


T 7fi9 

J / D Z 


IV-V-iAHai oLjL. i. ALjL.1 AI^AAL-IjA k3urVjV7Av> 1 


O T ^ 1 

z7o 1 


/^A /^TT/^/^/^^ TV IV A T T A f^/~^ A 

CACUCCCC A ACAUAGGA 


5510 


'^7^4 

J / D *4 


AiHwV^iAi tjLrV-^ i AijL. i AL-AAL-VjA (j i ooijo Avj 


2732 


/~»T T/^/^/^/^A/^ a ATTA/^y~1AATT 

CUCCCCAC A AUAGGAAU 


5511 


7 770 


TTID! A ^T* A T" /^/^/^'PAPO'P A/^A A^^/^ A 'POO'P A "P/^ip 

iULjACiAi VjLjUiA^v^i ALAALCjA iLLlAlCji 


2 733 


TV TV T T7V ^ TV 7V T T TV T T /I 7\ 

ACAUAGGA A AUAGUCCA 


5512 


1 77*5 


n^A'pf^rtA/^ r^r^r^T' aoo'papa a c*r^ a 'pa*p'p/^/^'pa 
VjVjAHjIjAL ouL 1 AuL 1 ALAALvjA lAilLLlA 


2734 


TTTV/^A*1TV TVTTTV /^T T/^ Tl T T/^/^ 

UAGGAAUA G GUCCAUCC 


5513 


J / / f 


/^TT^/'S/V A "P r^r^/^'PAPP'P A A A A /^/^ A /""T" A T"T> 

L i VjVjViVjA i LsIjL 1 AvjL 1 AL AALUA CiGACTATT 


2735 


TV TV T T TV /*1T T/**1/^ TV TV T T/TI^^/^^*I TV 

AAUAGUCC A AUCCCCAG 


5514 


T 7P c; 


1 LitjLLiAA i LibL i AL>L i ALAALCjA LTGGGGAT 


2736 


^ T T/^ TV TV TV T TT T/^y*l /^/^ "n 

AUCCCCAG A AUUCGCCA 


5515 


■^7^0 
J / o ? 


a A A'Pr^/^r' f^OO'PA/^r^T'A/~'A AOO A A A 'P^~"P*^/^ 

AUAA 1 LjCjL CjVjL 1 ALrL i ALAALCjA GAA i L i GG 


2 73 7 


/^/^ A A T TT Ti^ /~1 TV T TT T/*1T T 

CCAGAUUC G GCCAUUGU 


5516 


7 7Q9 


TTiaaoaaT" /ri/^/^'PAor^T'A^A a/^/^a /^pp^aa'p^' 
1 CjAALAA 1 VjVjL 1 AGL i ALAALGA GGLGAA i L 


2738 


GAUUCGCC A AUUGUUCA 


5517 


3 795 


acixwn.'^'h.r^ /■vir'Ta/^r^'pa/^aarr'a AA'pn*^/^r<A 
vj^JVJ 1 vjAAL vjVjL 1 ACjL 1 ALAALGA AA 1 GGLGA 


2739 


TT^^^^TVTTTT ^TTTT/^ A /^/^/^ 

UCGCCAUU G GUUCACCC 


5518 


7 Q Q 


r^r^ ar'/^or^'p /^^t^'pa/^i^'pa/^a ai^^^a oAA/^AArn/^ 
U kjAvj^j^aO i vjCj L i AG L i AL AAL G A G AAL AA i G 


2740 


CAUUGUUC A ACCCCUCG 


5519 




cir\r*'L.clciCLn rrr^pai^/^'par'a ar'r'A r*T\.r*r*r*r^^r* 
ovjUAVjVjvjL CjCjL i ACjL i ALAALGA GAGGGG i G 


2 741 


CACCCCUC G GCCCUGCC 


5520 


3811 


AVjVjAIjWjvjL ooL 1 AGL i ALAALGA AGGGLGAG 


2 742 


^^T T/^/^ /^/^ /^T T i^/^/^/^TT/^^^TT 

CUCGCCCU G GCCCUCCU 


5521 


J O ^ J. 


^nn'^'h.nn.r* f^r'/^'par'O'paoa a^t^a AAAr«r«A/^o 
i uVjAAuvjL Vj^jL 1 AGL 1 ALAALGA AAAGGACiG 


2743 


CCUCCUUU G GCCUUCCA 


5522 


^ o ^ o 


n'vnn.r*r*r*'v ^rr^'PAr^r^PAr^A a^/^ a p/^aa^'o/^a 
Vj i Vj^j»sjIjVj i CjGL i AGL i ALAALGA GGAAGGLA 


2744 


UGCCUUCC A ACCCCCAC 


5523 


J O O I 


TPn^^a'Pr'^'P ^'l/^/^'PAr^/^nr'A/^A a^^v^a r^r^t^r^t^T^r^r* 
i GGA i GG i CiGL i AGL i ALAALGA GGGGGTGG 


274 5 


CCACCCCC A ACCAUCCA 


5524 


J O J / 


AL L 1 GGA i GGL 1 AGL 1 ALAALGA CjG TGGGGG 


2746 


CCCCCACC A AUCCAGGU 


5525 


J O ft J 


(^•PO'PO/~'a/^ rV^/^iP A^O^P A O A A A /~*tT*r^/~* A T'/^/"' 

G i L 1 L LAL CjGL i AGL 1 ALAALGA CTCjGATCjG 


2747 


CCAUCCAG G GUGGAGAC 


5526 


J O *± J? 


c*'vc*'h.nr*r"v pr*<^T'Ar^r"PA/^A a/^/^a <^'P(^r^A/^r^'p 
^ i LALtvjCj i vjCjL i ACjL i ALAALGA L i LLALL i 


2 74 8 


TV f^^^T T^/^ TV TV TV /^T T/^ Tl 

AGGUGGAG A ACCCUGAG 


5527 


3 861 


c^r^f^'L.nf^rw <^/^r"pAr^r"PA/^A a^oa /^/^'p'p/^'p/^a 
L.^- v-Avj*j^j 1 VjCjL i AGL J. ALAALGA LL 1 i L 1 LA 


2 74 9 


T T/^ TV TV TV /^/^ TV TV /^/^/^T 

UGAGAAGG A ACCCUGGG 


5528 


3 870 


CCC^'^n'h.nr* <'2/^r"ParT"par^aa/^/^A 'PO/^r^Ar»/^r' 
k^v- V-AVjAvjL vjVjL 1 AGL i ALAALGA 1 LLLACiGG 


2750 


/^T T/^ y^ TV /^y^T Tr^^^^ 

CCCUGGGA G GCUCUGGG 


5529 


3 8 7 9 


/-irriz-ipTv A Alp /vsr"PBr^r»'P ao a a A "PO^^r^AOAo 
u AAA 1 CjVjL i ACjL 1 ALAALGA ILL L AGAG 


2751 


^tT T^^T ^% TV TV TV T TT TT T^^ TV 

CUCUGGGA A AUUUGGAG 


5530 


ft p 
^ o o o 


i i CjCj i LAL GGL i AGL 1 ALAALGA i LLAAA 1 1 


2752 


AAUUUGGA G GUGACCAA 


5531 


"5 P P Q 


LL i J. 1 GG i CjGL 1 AGLTALAACGA CACTCCAA 


2753 


UUGGAGUG A ACCAAAGG 


5532 


J oyo 


GGGLALAL GGL 1 AGLTALAACGA lTTTCjGTC 


2754 


GACCAAAG G GUGUGCCC 


5533 


1 Q O Q 
J O 27 O 


LAGGGLAL GGLTAGCTACAACGA ACCTTTCjG 


2755 


CCAAAGGU G GUGCCCUG 


5534 


"5 Q n n 
o y u u 


iALAGGGL GGLTAGCTACAACGA ACACCTTT 


2756 


AAAGGUGU G GCCCUGUA 


5535 


"3 Q O C 

o y u D 


P'P/^'PO'PA^ TV /^^rp TV /~1 TV TV /*»^ TV TV /^/^/^/^ TV IV 

LiGiGiAL GGLTAGCTACAACGA AGGGCACA 


2757 


UGUGCCCU G GUACACAG 


5536 


J 7U / 


y^^/'iiii/^Mir^rp ^ o O'P TV O /"'i 1 1 TV iv Tv f^f^ tv tv^tv^^^^tv 

GLLiGlGT GGCTAGCTACAACGA ACAGGGCA 


2758 


UGCCCUGU A ACACAGGC 


5537 


J J7 U J 


'P^^^^^^^^Tf~'^^ <^ooT'A<~'/^T'Af^A a/^/^a ^•paoa^o/^ 
i i Lj i GGL i AGL i ALAALGA GTALACjGG 


2759 


CCCUGUAC A ACAGGCGA 


5538 


3913 


GTCCTCGC GGCTAGCTACAACGA CTGTGTAC 


2760 




t; C "5 Q 

3 3 J y 


3919 


TGCAGGGT GGCTAGCTACAACGA CCTCGCCT 


2761 


AGGCGAGG A ACCCUGCA 


5540 


3924 


CCAGGTGC GGCTAGCTACAACGA AGGGTCCT 


2762 


AGGACCCU G GCACCUGG 


5541 


3926 


ATCCAGGT GGCTAGCTACAACGA GCAGGGTC 


2763 


GACCCUGC A ACCUGGAU 


5542 


3932 


ACCCCCAT GGCTAGCTACAACGA CCAGGTGC 


2764 


GCACCUGG A AUGGGGGU 


5543 


3938 


ACAGGGAC GGCTAGCTACAACGA CCCCATCC 


2765 


GGAUGGGG G GUCCCUGU 


5544 
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"i O A A 


rGALCLAC GGCTAGCTACAACGA AGGGACCC 


2766 


GGGUCCCU G GUGGGUCA 


5545 


3 94 8 


AATTTGAC GGCTAGCTACAACGA CCACAGGG 


2767 


CCCUGUGG G GUCAAAUU 


5546 




CCLLCAAl (jCjL i ACjL 1 ACAALGA TTGACCCA 


2768 


T T/>/^ /^T T/^ TV TV TV TV T TT T^/^^^ ^ 

UGGGUCAA A AUUGGGGG 


554 7 


J i704 


L.ALAGLAC GGCTAGCTACAACGA CTCCCCCC 


2769 


GGGGGGAG G GUGCUGUG 


5548 




CCUACAGL GGC TAGCTACAACGA ACCTCCCC 


2770 


GGGGAGGU G GCUGUGGG 


5549 


o y D y 


ACiCCCAC GGCTAGCTACAACGA AGCACCTC 


2771 


GAGGUGCU G GUGGGAGU 


5550 


1 Q T C 


lAiiliAC GGCiAGCTACAACGA TCCCACAG 


2772 


CUGUGGGA G GUAAAAUA 


5551 


J y ou 


TTCAGTAl GGCTAGCTACAACGA TTTACTCC 


2773 


GGAGUAAA A AUACUGAA 


5552 


3982 


TATTPAHT finPTAnPTaPaaPna ZVTTTTZir'T 
■1- A X 1 \^tv>3 1. OOX^ 1 1 AV..nn.^o/a A x 1 1 i. nl.^ 1 




RPTTananTT a a/^ir'AaTTa 
AkjUAAAAU A ACULxAAUA 


c t; c o 


3987 


TCATATAT GGCTAGCTACAACGA TCAGTATT 


2775 


AAUACUGA A AUAUAUGA 


5554 


3989 


ACTCATAT GGCTAGCTACAACGA ATTCAGTA 


2776 


UACUGAAU A AUAUGAGU 


5555 


3991 


AAACTCAT GGCTAGCTACAACGA ATATTCAG 


2777 


CUGAAUAU A AUGAGUUU 


5556 


3995 


TGAAAAAC GGCTAGCTACAACGA TCATATAT 


2778 


AUAUAUGA G GUUUUUCA 


5557 


4003 


TTCAAAAC GGCTAGCTACAACGA TGAAAAAC 


2779 


GUUUUUCA G GUUUUGAA 


5558 



Seql = TERT (Homo sapiens telomerase reverse transcriptase (TERT) mRNA, 4015 bp); Nakamura et al. Science 277 (5328), 
955-959(1997) 

Cut Site = RA' (Purine/Pyrimidine) 

Stem Length = 8 . Core Sequence = GGCTAGCTACAACGA 
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Table VII: Anti-TERT HH and G-Cleaver Ribozymes 



Alias 


Ribozyme Sequence 


Seq ID 
Number 


Length 
(nt) 


HH 








TERT-1051 


AGGAGUA CUGAUGAGGCCGUUAGGCCGAA AGGAAGU 


5559 


36 


TERT-1053 


UGAGGAG CUGAUGAGGCCGUUAGGCCGAA AGAGGAA 


5560 


36 


TERT-1918 


UGAAGCG CUGAUGAGGCCGUUAGGCCGAA AGUCUGG 


5561 


36 


TERT-2383 


GAGCCAC CUGAUGAGGCCGUUAGGCCGAA AACUGUC 


5562 


36 


TERT-2485 


UGAAGCG CUGAUGAGGCCGUUAGGCCGAA AGGAAGA 


5563 


36 


TERT-2566 


GCGUGGA CUGAUGAGGCCGUUAGGCCGAA AGGAUGG 


5564 


36 


TERT-3181 


AGUAGCA CUGAUGAGGCCGUUAGGCCGAA AGGGAGG 


5565 


36 


TERT~3691 


CUGUGGG CUGAUGAGGCCGUUAGGCCGAA AAGUGAA 


5566 


36 


TERT-3758 


AUGUGGG CUGAUGAGGCCGUUAGGCCGAA AGUGGAA 


5567 


36 


TERT-3794 


GGUGAAC CUGAUGAGGCCGUUAGGCCGAA AUGGCGA 


5568 


36 


G-Cleaver 








TERT-757 


UUGGG UGAUGGCAUGCACUAUGCGCG AACGGCAGAC 


4332 


36 


TERT-2353 


UCUGU UGAUGGCAUGCACUAUGCGCG AAGGUAGAGA 


4471 


36 


TERT-3795 


GUGAA UGAUGGCAUGCACUAUGCGCG AAUGGCGAAU 


4594 


36 
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